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Abstract 

We construct a finitely presented group with coNP-complete word problem, and a 
finitely generated simple group with coNP-complete word problem. These groups are rep- 
resented as Thompson groups, hence as partial transformation groups of strings. The proof 
provides a simulation of combinational circuits by elements of the Thompson-Higman 
group G 3j i. 

1 Introduction 

There are many open problems in computational complexity, e.g., the famous questions "P 
7^ NP ?", and "NP 7^ coNP ?", that are believed to be very difficult. One way to approach 
very difficult problems is to relate them to other disciplines. For computational complexity 
there are interesting relations with combinatorial group theory. An early connection was Max 
Dehn's formulation of the word problem of a group (1910). It took 45 years until it was 
shown that there is a finitely presented group whose word problem is undecidable, and that 
certain finitely presented groups can simulate universal Turing machines (Novikov 1955, Boone 
1954-57). Soon after, Higman's embedding theorem (1961) gave an algebraic characterization 
of recursive enumerability of the word problem of a group G (namely, G has a recursively 
enumerable word problem iff G is isomorphic to a subgroup of some finitely presented group). 
Boone and Higman (1976) gave an algebraic characterization of decidability of the word problem 
of a group G (namely, G has a decidable word problem iff G is isomorphic to a subgroup of 
some simple group, which itself is a subgroup of some finitely presented group). It was also 
proved that some finitely presented groups have a primitive recursive word problem; in fact, 
Madlener and Otto JH] gave a version of the Higman embedding theorem that preserves the 
Grzegorczyk hierarchy from level 3 upward. Madlener and Otto also introduced what was later 
called the isoperimetric function of a group. 

It has long been folklore knowledge that (un) decidability, recursive enumerability, primitive 
recursiveness, and the Grzegorczyk level, of the word problem of a finitely presented group G is 
an algebraic property of the group, i.e., if one changes over to a different finite set of generators 
of the same group, the property is preserved. Madlener and Otto showed that the isoperimetric 
function of a group changes only linearly when one changes the finite presentation of the group. 

*Research supported in part by NSF grant DMS-9970471, and in part by NSERC grant 216872-1999 



1 



A similar argument shows that the computational complexity (time or space, deterministic, 
nondeterministic, or co-nondeterministic) of a group changes only linearly under change of fi- 
nite generating set. So, combinatorial group theory gives us the following advantage over the 
ordinary formal language formulation of computational complexity: algebraic invariance. Com- 
plexity is a property of the group, no matter how the group arizes (as words in a presentation, 
or as transformations of a space, or a set with a composition operation). Note, however, that 
this invariance only holds as long as we stick to finite generating sets. 

It was also shown [3] that every decision problem L can be reduced (by a one-to-one linear- 
time reduction) to the word problem of some finitely generated group Gl, with the property 
that this word problem has the same time complexity (up to a linear factor n) as the problem 
L. (This was proved for deterministic and nondeterministic time complexity, but the proof 
for the deterministic case also works for co-nondeterministic time complexity.) So, the word 
problem for finitely generated groups is as general as decision problems overall, as far as time 
complexity is concerned. Also, as a consequence, there exist finitely generated groups whose 
word problem is NP-complete, or coNP-complete. 

The word problem of finitely presented groups is naturally related to nondeterministic time 
complexity; indeed, for a finite presentation (A,R), a word w over A ±l is equivalent to e 
(the empty word) iff there exists a rewrite sequence consisting of applying relators in R; this 
rewrite process can be "guessed" and carried out by a nondeterministic Turing machine. More 
precisely, there is a close connection between the isoperimetric function and nondeterministic 
time complexity. In |2S] arid [Zj it was shown that the word problem of a finitely generated group 
G is in NP iff G is embeddable in a finitely presented group whose isoperimetric function is 
polynomially bounded. This implies that there exist finitely presented groups with NP-complete 
word problem. The theorem extends to other nondeterministic time complexity classes. A 
semigroup version of this result had been proved earlier It was also shown in j2H| that a 
function / (with f(n) > n 4 ) is an isoperimetric function if fin) 4 is the time complexity of a 
nondeterministic Turing machine and if / is superadditive (i.e, f(x + y) > fix) + f(y))- In 
particular, all functions n a with a > 4 (a G Q) are isoperimetric functions. Later, Brady and 
Bridson 9J also proved that n a is an isoperimetric function for all a ranging over a countable 
dense set of real numbers > 2. See also Section 3 of jS]. On the other hand, there is no 
isoperimetric function between n and n 2 . More precisely, if an isoperimetric function / satisfies 
fin) = o(n 2 ) then fin) = O(n); this is known as the Gromov gap. Groups with linear 
isoperimetric function are called "word hyperbolic" (see (HI); they have shown up in many 
situations, and they have many special properties (e.g., their word problem can be decided 
in linear time by a deterministic Turing machine). In summary, the study of connections 
between combinatorial group theory and nondeterministic time complexity has been successful, 
especially for combinatorial group theory, regarding isoperimetric functions. 

In this paper we look at connections between co-nondeterministic time complexity and 
combinatorial group theory. By definition, a problem L (represented by a formal language) 
is in coNTime(T) iff L is accepted by a co-nondeterministic Turing machine in time T. A 
co-nondeterministic Turing machine is a Turing machine M which is allowed to make choices 
(just like a nondeterministic Turing machine), but which uses the following acceptance rule: 
a word w is accepted by M iff all computation paths of M with input w lead to an accept 
state. So, "for-all" is used instead of nondeterminism's "there-exists" . An equivalent definition 
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is that coNTime(T) consists of the languages whose complement is in NTime(T). Most books 
on computational complexity discuss coNTime(T) and coNP; see e.g., [321 • There are many fa- 
mous coNP-complete decision problems, that are as significant as the well-known NP-compete 
problems (although NP is much more popular than coNP in Computer Science). Here is a 
sampling: 

• The tautology problem: Given a boolean formula, is it a tautology, i.e., is it true for all truth- 
value assignments? Informal versions of this problem goes back to antiquity; the tautology 
problem is "the decision problem of boolean logic" . 

• The circuit equivalence problem: Given two acyclic boolean circuits (also called "combina- 
tional circuits"), do they have the same input-output function? 

• Integer linear programming equivalence problem: Given two instances of integer linear pro- 
gramming, do they have the same set of feasible solutions? 

• The 4-coloring problem: Given a planar graph, do we need four colors to vertex-color it? (Note 
that every planar graph is 4-colorable, and the question whether a planar graph is 3-colorable 
is NP-complete.) 

• Connectivity lower-bound: Given a graph and an integer k, is the connectivity of the graph 
greater than k? Equivalently, does the graph remain connected when any k edges are removed? 

Since there is a close connection between nondeterminism and finitely presented groups, as 
we saw, and since NP is believed to be different from coNP, one might expect at first that 
there is no natural connection between co-nondeterminism and combinatorial group theory. 
However, if we take transformation groups as our starting point we see a hint at a connection: 
In a transformation group two elements g\ and g 2 (permutations) are equal iff g\{x) = g2{x) 
for all x in the action space. Here again the for-all quantifier shows up, which corresponds to 
co-nondeterminism. In order to investigate the complexity of problems about transformation 
groups, it is convenient to consider groups of transformations of words (i.e., strings over a 
finite alphabet). The groups introduced by Richard Thompson [30J in the 1960s turn out to be 
appropriate for this, not only based on their nice definition, but also based on their history: they 
were used for constructing finitely presented groups with undecidable word problem [2D] , and for 
proving a stronger form of the Boone-Higman theorem [3T] . Below we give some background on 
these groups. Note that here we do not view the Thompson groups as a special class of groups 
(as is usually done in the literature), but as a general formalism for describing all countable 
groups; in fact, all subgroups of (5^ can be represented as Thompson groups (@n denotes the 
group of all permutations of the natural integers). 

In order to achieve coNP-hardness we show that every acyclic circuit can be "simulated" by 
an element of a particular Thompson group (namely the finitely presented Thompson-Higman 
group G 3) i, defined below). So, we simulate a circuit by a permutation of strings over the 
3-letter alphabet {0,1, The simulation is such that two circuits are equivalent iff their 
simulating permutations are equal when restricted to all strings that start with 0. Technically, 
the Thompson group elements are partial permutations of {0, 1, #}* that map certain maximal 
prefix codes bijectively to maximal prefix codes (see the background on Thompson groups be- 
low). This simulation is a polynomial-time many-to-one reduction from the circuit equivalence 
problem (which is coNP-complete) to a problem about the Thompson-Higman group G^ x- in 
a succession of steps (see the more detailed outline of the paper below), we reduce the latter 
problem to the word problem of another finitely presented Thompson group. We also reduce 
this problem to the word problem of a finitely generated simple group (and we conjecture that 
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this simple group is actually finitely presented). Moreover, we show that all the groups above 
have their word problem in coNP. 

Our simulation of acyclic circuits by group elements is similar to the construction of a 
reversible circuit. This connects this paper with the classical topic of reversible computation 
(see ^H], for reversible Turing machines, and ^T] for reversible acyclic circuits). In our 
case the result is stronger, since we do not just get reversibility but a finitely presented group. 

Another motivation for this paper is a conjecture attributed to Higman about a stronger 
form of the Boone-Higman theorem. The conjecture is that a finitely generated group G has 
decidable word problem iff G is embeddable into a finitely presented simple group. (It is well 
known that every finitely presented simple group has a decidable word problem.) 

A consequence of this conjecture would be that the word problem of finitely presented simple 
groups can have arbitrarily large time complexity. This means that for every function T which 
is the time complexity of a deterministic Turing machine, there is a finitely presented simple 
group whose word problem cannot be decided in time < T. (Indeed, by finitely generated 
groups G have arbitrarily high complexity; moreover, a finitely generated subgroup G of a group 
S cannot have higher complexity than 5*, up to linear changes in the complexity function.) 

On the other hand, all known finitely presented simple groups have word problems with 
rather low complexity (in the cases where the complexity has been analyzed in detail it always 
turned out to be in the complexity class P). In that connection, see [21] and also [12], [To"] . 
So, one might ask the opposite question: Is there some cap on the computational complexity 
of the word problem of finitely presented simple groups? At the moment, neither Higman's 
conjecture nor the opposite question have much evidence in their favor (and, a priori, they 
could both be wrong). A contribution of this paper, in the direction of Higman's conjecture, 
is the construction of a finitely generated simple group with coNP-complete word problem; we 
conjecture that this group is also finitely presented. 

Some background and notations on the Thompson groups 

The Thompson groups, introduced by Richard Thompson in the 1960s [SOI EI], provided the 
first known examples of simple finitely presented infinite groups. Although Thompson defined 
his groups as permutation groups of certain sets of infinite words over the alphabet {0, 1}, 
we prefer the approach of E. Scott [27J and G. Higman JT3], which enables us to define the 
Thompson groups as partial actions on the words over a finite alphabet. The advantage of 
finite words is that algorithmic problems and their complexity can be defined in a direct way. 

Let us introduce some terminology; we have made an effort to stay close to classical or 
widely used concepts. More details (and proofs) can be found in |0], and often also in [21], [T3| . 
and [SI]. For a finite alphabet A, the set of all words over A (including the empty word e) is 
denoted by A*. We will assume from now on that A has a least two letters. Concatenation 
of two words u,v G A* is denoted by u ■ v or uv; A* is a monoid under concatenation. For 
Xi,X2 C A* the concatenation is X\ ■ X2 = X1X2 = {X1X2 G A* : x\ G X\,X2 G X2}. A right 
ideal of A* is defined to be a subset RCA* such that R ■ A* C R (i.e., R is closed under 
concatenation by any word in A* on the right). For two words u, v G A*, we say that u is a 
prefix of v iff v = ux for some x G A*; we also write u > pre f v or v < pre f u; this is a partial 
order, related to set inclusion by the fact that v < pre f u iff vA* C uA* . We say that u and 
v are prefix- comparable iff v < pre f u or u < pre f v; we denote this by u ^ pre f v. A prefix code 



4 



over A is defined to be a subset C of A* such that no element of C is a strict prefix of another 
element of C . A maximal prefix code over an alphabet A is a prefix code over A which is not a 
strict subset of any other prefix code over A. For a right ideal P of A*, a set F C P is called a 
set of right-ideal generators of P iff P = T ■ A*. One can prove that any right ideal P of ^4* 
has a unique minimal (under inclusion) set of right-ideal generators, and this set of generators 
is a prefix code. Right ideals of A* and prefix codes over A are in one-to-one correspondence. 
A right ideal R of A* is said to be finitely generated iff the prefix code corresponding to R is 
finite. A right ideal R of A* is called essential iff R has a non-empty intersection with every 
right ideal of A*. One can prove that a right ideal is essential iff its prefix code is a maximal 
prefix code. 

A right-ideal homomorphism of A* is defined to be a function (p : R\ — > R2 such that Ri 
and i?2 are right ideals of A*, and such that for all u G Pi and all x G A*: ip(u) ■ x — ip(ux). 
A right-ideal isomorphism of A* is a bijective right-ideal homomorphism. The set of all right- 
ideal homomorphisms (or isomorphisms) of A* is in one-to-one correspondence with the set of 
all functions (respectively bijections) between prefix codes of A*. For a right-ideal isomorphism 
ip : PiA* — ► P2A*, where Pi and P2 are prefix codes, the restriction t v : P\ — » P2 is a bijection, 
and determines 9? uniquely. Following Thompson, the restriction : Pi —>■ P 2 of 9? will be 
called the to&ie 0/ (p, and will be used to represent p by a traditional function table. (In 
and j22j this was called the "symbol of </?".) The maximal prefix code Pi is called the domain 
code of ip, and P2 is called the image code or range code of </?. An extension of a right-ideal 
isomorphism y> : Pi — > R2 is defined to be a right-ideal isomorphism $ : Ji — > J2 where Ji, J2 
are right ideals such that Pi C J 1( P 2 C J 2 , and $ agrees with ip> on Pi (i.e., $(x) = <p(x) for 
all x G Pi). In that case we also call (p a restriction of $. A right-ideal isomorphism is said to 
be maximal iff it has no strict extension in A*; it is called extendable otherwise. We denote the 
maximum extension of <p by max if; one can prove (see j2Zj or [U]) that the maximum extension 
of an isomorphism between essential right ideals is unique. 

The above concepts can be pictured using trees. The monoid A* can be described by the 
Cayley graph of the right regular representation of A* relative to the generating set A. We will 
simply call this the tree of A*. It is an infinite tree rooted at the empty word e. Every vertex 
has \A\ children. Every subset of A* is pictured as a set of vertices of this infinite tree. A prefix 
code is pictured as a set of vertices, no two of which lie on a same directed path from the root. 
For any prefix code P C A* (P 7^ 0), the prefix tree of P is defined to be the subtree of the tree 
of A*, whose vertex subset consists of all the prefixes of words in P (and whose root is still e). 
Hence, the set of leaves of this subtree is P. 

One can prove (see [27| or j^j) that an isomorphism of finitely generated essential right 
ideals (p : Pi A* — > P 2 A*, with Pi and P2 finite maximal prefix codes, is extendable iff there 
are Xo,2/o G A* such that for every letter a G A: xoa G Pi, yoa G P2, and (p(xoat) = y^a. 
(If this condition holds, <p can be extended by mapping xo to yo.) More generally (see |6J, an 
isomorphism of (not necessarily finitely generated) essential right ideals (p : P\A* — > P 2 A*, with 
Pi and P2 arbitrary maximal prefix codes, is extendable iff there are xq, yo G A* and there exists 
a maximal prefix code Q C A* with \Q\ > 1 such that for all q G Q : x q G Pi, y$q G P2, and 
(p(x q) = y q. 

We now define the Thompson groups, following the approach of Scott [2Zj and Higman |14j . 
The tree representation of codes connects this definition and the definition by action on finite 
trees used in [TO]. The Thompson-Higman group G^,i is the partial action group on A* 
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(for some fixed alphabet A with \A\ = N), consisting of all maximal isomorphisms between 
finitely generated essential right ideals of A*. The Thompson-Higman group Qn,i is the partial 
action group on A* consisting of all maximal isomorphisms between essential right ideals of A*. 
Multiplication in Qn,i, and hence in the subgroup Gn,i and in any subgroup of Qn,i, is defined 
as follows: For (p, ip e Qn,i the product (p • ip is max(p o x/j) (i.e., the maximum extension of the 
composition of ip and p, where ip is applied first). In general, in this paper, we apply (partial) 
functions on the left of the argument, and hence compose functions from right to left. 

In this paper we call any partial transformation subgroup of Qn,i (for any integer N > 2) 
a Thompson group. (This is a slight misnomer, since these groups are actually more than just 
groups; they are partial transformation groups.) It is easy to see that every countable group 
is isomorphic to a Thompson group; in fact (see e.g. [§]), every subgroup of &i (the group of 
all permutations of the integers) can be represented as a Thompson group. It is remarkable 
that Thompson groups consist of partial transformations; it is the uniqueness of the maximal 
extension that enables them, nevertheless, to be groups. 

Main results 

In this paper we use polynomial-time constant- arity conjunctive reduction (instead of many- 
to-one reduction). This is defined in Definition 15.11 The complexity classes P, NP, coNP, as 
well as most other common complexity classes containing P, are closed under this reduction. 

Theorem 1.1 There exists a finitely presented group G whose word problem is coNP-complete 
(with respect to polynomial-time constant- arity conjunctive reduction). 

Moreover, we have: 

• The group G is explicitly embedded into Q^i as G = (G™° d3 (0, 1; #) U{k 32 i}) (see Theorem 
\8.3\) . The subgroup G™° d3 (0, 1; #) of G^\ is defined in Definition ^. J\ and at the end of Step 1 
below, and is finitely presented. The element K321 = K3K2K1 £ Gz,i is defined in Section 2. 

• G is an HNN extension (by one stable letter) o/G™° d3 (0, 1; #). Moreover, G is isomorphic 
to a semidirect product G™° d3 (0, 1; jf) X Z. 

Theorem 1.2 There exists a finitely generated simple group S whose word problem is coNP- 
complete (with respect to polynomial-time constant- arity conjunctive reduction). 

The group S is explicitly embedded into & 3) i as S = (G 3< i U {k , Ki, k 2 })', i.e., the commutator 
subgroup of (G*3,i U {k , k 1; k%}) (Theorem I8.5|) . where k , k\, and k 2 are elements of 
defined in Section 2. Moreover, S has finite index in (G^i U {k , k%, ^2})- 

We conjecture that (£3,1 U {k q , Ki, and hence S, is not only finitely generated but also 
finitely presented. This would give us a finitely presented simple group with coNP-complete 
word problem. 

Overview of the paper 

• Step 1 (Sections 2 and 3): 

Recall that G^i is the Thompson-Higman group of right-ideal isomorphisms between finitely 
generated essential right ideals of the free monoid {0, 1, #}*. It is well known that G^i is finitely 
presented ^^; let A^i be a finite generating set of G^i. We give a polynomial-time many-to-one 
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reduction of the circuit equivalence problem to the following u word problem with restriction" in 
the Thompson-Higman group G 3jl : 

INPUT: Two words u, v over A^} U {r^+i : i > 0}, where r^j+i is the element of that 
transposes the bits in positions % and % + 1 in any string XqX\ . . . . . . # G {0, 

Question: Are the two elements of G 3i i, represented by u, v, equal when restricted to the 
subset 0{0, 1}*# of {0,1, #}* ? 

In order to find the above reduction, we first represent the circuit components by elements 
of G 3:1 : and, OR, not, as well as wire forking (i.e., duplication or copying of variables), and 
wire crossing (i.e., permutations of variables); wire crossings are described by the transpositions 

T i,i+1- 

Now let C be any acyclic boolean circuit, with input-output function fc : {0, l} m — > {0, 1}™. 
We simulate C by a Thompson group element $c G G 3: i such that: 

- the action of & c on the subset 0{0, 1}*# represents the function f c in the sense that for all 
x , Xi, . . . , x m G {0, 1} and all to G {0, 1}*: 

#) = Oxi . . . x m /c(xi, . . . , x m ) w #; 

- the word-length of $c over A^} U {r iyi+ i : i > 0}, as well as the largest subscript of the r i:i+ i 
used to represent <3>c, have a polynomial upper bound in terms of the circuit size |C|; 

- a word Wc over A^J U {r^+i : % > 0}, representing $,7) can be computed deterministically in 
polynomial time (in terms of |C|. 

Note that although G 31 is finitely generated, we are using an infinite generating set here 
in order to obtain the word Wq with polynomial length; in fact, t^+i has exponential word- 
length over A 3i i. Eventually we will want a finitely generated (and finitely presented) group 
for representing C. For this we introduce elements k-i G Q 3j i i — 0, 1,2,3 such that each r^+i 
has polynomial word length over A 3i i U {/to, «i, k 2 , k 3 }. However, Ki does not belong to G 3 ,i- 

We observe that $ c and the representatives of the circuit elements belong to the following 
subgroup of G 3 ^\ 

G^ d3 (0,l;#) = 

{0GG 3i i: and <p~ l map {0, 1}* to {0, 1}* and map {0, 1}*# to {0, 1}*#; 
and _1 are defined everywhere on {0, 1}*#; 

moreover, for all x G {0, 1}*, \4>{x)\ = |x| mod 3 when 0(x) is defined}. 

From now on we will usually use G™° d3 (0, 1; #), rather than G 3il . Later we will prove that 
Gff 3 (0, 1; #) is finitely presented. 

• Step 2 (Section 4): 

It follows from step 1 that two circuits Ci, C2 are equivalent iff c&^^Ci fixes every point in 
0{0, 1}*# on which <f>^}®Ci is defined. Thus, we have reduced the circuit equivalence problem 
to the generalized word problem of the subgroup pFix(0{0, 1}*#) of G 3 I1 ° d3 (0, 1; Here, for 
any 5 C {0, 1, #}*, pFix(S') denotes the "partial fixator" 

pFix(S) = {0 G G^ d3 (0, 1; #) : fixes all points of 5 on which is defined}. 

In these problems we still represent words over the infinite generating set A^{ U {r i)i+1 : i > 0} 
for G 3A . 

• Step 3 (Section 5): 

We show that for any g G G 3 n ? d3 (0, 1; #): 
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g G pFix(0{0, 1}*#) iff gh = hg for all h G pFix({l, #}{0, 1}*#). 

• Step 4 (Section 6): 

Moreover, pFix({l, #}{0, 1}*#) is finitely generated (and in fact finitely presented). The above 
commutation relation only needs to be checked between g and the finitely many generators of 
pFix({l, #}{0, 1}*#). The group G™° d3 (0, 1; #) is also finitely presented. 

As a consequence of steps 3 and 4, we have reduced the circuit equivalence problem to the 
word problem of G 3 n ° d3 (Q, 1; jf) (and hence also of G 3j i), via a polynomial-time constant-arity 
conjunctive reduction (the arity being the number of generators of pFix({l, #}{0, 1}*#)). The 
generating set of G™° d3 (0, 1; if) used for these problems is still the infinite set AU{r iji+ i : i > 0}, 
where A is any finite generating set of G™° d3 (0, 1; jf). 

• Step 5 (Section 7): 

We show that conjugation by (i — 0, 1, 2, 3) is an automorphism of G™° d3 (0, 1; #). (It is for 
this property that we needed the length-preservation mod 3 in the elements of G™° d3 (0, 1; #).) 
Hence, the following HNN extension yields a group /f(0, 1; which contains G™° d3 (0, 1;#) 
and k^k^i- 

#(0,1; #) = (G^ d3 (0,l;#)U{t} : {t ^t" 1 = <7"^ : <? G G^ d3 (0, 1; #)}). 
Since in step 4 we saw that G™° d3 (0, 1; #) is finitely presented, #(0, 1; #) is finitely presented. 
The transpositions r iyi+ i have linear word-length over the finite generating set of H(0, 1;#). 
Hence, the circuit equivalence problem reduces (via polynomial-time constant-arity conjunc- 
tive reduction) to the word problem of the finitely presented group #(0,1; #) (over its finite 
generating set). 

The group #(0, 1; #) is isomorphic to the subgroup (G™° d3 (0, 1; #) U {k 3 k 2 Ki}) of Gz,u 
and also to the semidirect product G™° d3 (0, 1; #) x Z. 

• Step 6 (Section 8): 

We prove that the word problems of #(0, 1; jf) and, more generally, of (G 3i i U {k , «i, k 2 }) are 
in coNP. As a consequence, the finitely presented group #(0, 1; #) has a coNP-complete word 
problem (relative to polynomial-time constant-arity conjunctive reduction); this is the group G 
of Theorem 11.11 

By results of Thompson and Scott, the commutator subgroup (£3,1 U {k , Ki, k 2 })' is a 
simple group. We prove that (G 3t iU{K , k 1 , k 2 })' has finite index in (G 3) iU{k , «i, ^2}}- Hence, 
(G 3t i U {/Co> K i> ^2})' is a finitely generated simple group with coNP-complete word problem; 
this is the group S of Theorem 11.21 Moreover, if (G 3j i U {ko, Ki, k 2 }) is finitely presented (as 
we conjecture), (G^i U {ko, Ki, ^2})' will also be finitely presented. 

• Appendix (Section 9): 

The first subsection of the Appendix contains properties of prefix codes, used in the paper. 

Another subsection of the Appendix shows that Theorems 11.11 and 11.21 and the Overview 
above also hold with G™° d3 (0, 1; #) replaced by another subgroup of G 3t \, namely by 

G™ d3 (0, 1) = {<p G G 3>1 : <p and <p~ l map {0, 1}* to {0, 1}* and 

for all x G {0, 1}*, |</>(s)| = \x\ mod 3 when 4>(x) is defined}. 

The proofs for Gf° d3 (0, 1) are similar to (but somewhat more complicated than) the proofs for 
G m ° d3 (0, 1; #), and appear in the Appendix. 

A special property is shown: Gf° d3 (0, 1) is the largest subgroup of G 3j i closed under con- 
jugation by k 3 k 2 k\. 
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2 Circuits and permutations of boolean variables 



Acyclic boolean circuits are a fundamental model of computation [33J, [32], [2E]. The equivalence 
problem for acyclic boolean circuits, mentioned above, is a well-known example of a coNP- 
complete problem. 

Circuits are traditionally built from the boolean functions AND, OR, and NOT, with domains 
{0, l} 2 or {0,1}, and image {0,1}. Moreover, circuits use the fork (or "fan-out", or "dupli- 
cation") function FORK: x G {0,1} i— > (x,x) G {0, l} 2 . The use of FORK is usually tacit; 
in a circuit diagram, FORK appears whenever a wire fans out (or forks, or splits) to become 
two wires that carry the same boolean value. One can view an acyclic boolean circuit as a 
composition of several copies of the functions and, or, not, and FORK. Since and, or, and 
fork are multi-variable functions, composition is complicated and requires a circuit diagram 
(which is essentially an acyclic graph) to describe how the operations are connected. We will 
use AND and OR gates with fan-in 2 only. 

We will see that a circuit can be represented by ordinary composition of functions, thanks to 
Thompson groups. We have seen that these groups can be described as partial action groups, 
acting on strings. We will use this partial action to simulate circuits. 

The functions AND, OR, not, and FORK that make up acyclic circuits use one or two 
boolean variables (that range over the set of boolean values {0,1}). An acyclic circuit has 
boolean variables (x , xi, . . . , x m -i) as input (ranging over all of {0, l} m ), and boolean variables 
(yo,yi, ■ ■ ■ ,y n -i) as output (ranging over a subset of {0, l} n ); the circuit computes a function 
/ : {0, l} m — > {0, l} n . We will extend the function / to the partial function 

XoXi . . . x m -iw G {0, 1}* i — > f(xo, Xx, . . . , x m -i) w G {0, 1}* 
for any w G {0, 1}*. We let boolean functions operate on an arbitrary (large enough) number 
of variables, rather than a fixed number. 

In this paper, we always write functions on the left of their argument. Also, we make the 
following convention: Let : A* — > A* be a partial map and x G A*; when we write <f>(x) it is 
to be understood that <f>(x) is defined (i.e., x G Dom(0)). 

Since we write the variables xq,xi, . . . ,x m -i, ... in a fixed order we need to introduce 
maps that permute these variables. In circuit drawings this corresponds to crossing of wires. In 
particular, we use the transposition of variables x i? Xj (with < i < j), defined by uxiVXjW G 
{0, 1}* i — ► uXjVXiW G {0, 1}*, where \u\ =i,\v\ — j — i — 1, w G {0, 1}*. 

The finite symmetric groups are generated by two elements, a transposition and a cyclic 
permutation. Here we also want to obtain a finite number of generators, but since we deal now 
with unbounded finite bit-strings, we need to consider new versions of the cyclic permutation. 
This, in turn, requires the introduction of a new letter into the alphabet; the new letter, 
denoted will act as a "boundary marker" for the cyclic permutations. A first idea of an 
unbounded cyclic permutation would be to take XoXi . . . x m -\j^vo i — > X\ . . . x m -iXo#w, for 
all Xo, xi, . . . , x m -\ G {0, 1}, and w G {0, 1, #}*; but it turns out that this definition does not 
lead to good properties (some Thompson groups that we will work with are not closed under 
conjugation by this permutation). So we will use the following permutations of N, written as 
infinite products of disjoint cyclic permutations. Recall that a cycle (i\j\k) (for three distinct 
elements i,j, k G N), denotes the permutation i i — > j i — ^ A; i — > and x i— >■ x for x G" {i,j, k}. We 
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denote the group of all permutations of N by &n- Again, recall that we write maps to the left 
of the argument. 



70 = (3n | 3n + 1 | 3n + 2) ... (3 | 4 | 5) (0 | 1 | 2), 

71 = (3n + 1 | 3n + 2 | 3(n + 1)) ... (4 | 5 | 6) (1 | 2 | 3) (0), 

72 = (3n + 2 | 3(n + 1) | 3(n + 1) + 1) ... (5 | 6 | 7) (2 | 3 | 4) (1) (0), 

7s = (3n | 3n + 1 | 3n + 2) ... (3 | 4 | 5) (2) (1) (0). 

Based on these permutations of N we define the following elements Kq, «i, k 2 , k 3 G C/ 3) i. 

The effect of (z = 0,1,2,3) on a string x^x\ . . .x m j^w (with x , £i, . . . , x m G {0,1}, 
w G {0, 1, #}*) is to permute the bits x Xi . . . x m according to 7,; the bit Xk at position k 
(0 < < m) is moved to position 7i(fc). Thus, Kj(x Xi . . .x m #w) = yoVi ■ ■ .y m #w, where 
y-yi(k) = Xk- Equivalently, yj = for < j < m. According to this definition, ^ is well 

defined on a string xqX\ . . . x m #w when m = % mod 3. To make Ki well defined on all strings 
in {0, 1, #}* we let Ki act as the identity on the one or two right-most "extra bits", when m is 
not = i mod 3. The detailed definition of hZi IS clS follows: 

• For x = Xq . . . Xi . . . x 3n+ 2 r where n G N, X{ G {0, 1} (0 < % < 3n + 2), and r G {0, l}- 2 , 
we define 

«0(Z) = ^ 7 -l(0) • • • ^(i) • • • a; 7o - 1 (3n+2) r #. and 
«3(») = ^(O) • • • • • • X 7 3 ~ 1 (3n+2) r #■ 

• Similarly, for x = x . . . Xi . . . x 3 ( n +i) r # we define 

= a; 7 -i( ) • • • • • • ^-^(n+i)) r #■ 

• For a; = x . . . . . . x 3(n+ i) + i r # we define 

k 2 (x) = x 7 -i (0) . . . ar 7 -i(i) • • • ^-^(n+iHi) r #• 

We will abbreviate k 3 k 2 ki(-) to k 32 i(-)- The element k 32 i G £/ 3) i will play an important role 
in this paper. 

The introduction of the new letter # in the boolean alphabet {0, 1} forces us to rethink 
the correspondence between the Thompson groups. We will now use the Thompson-Higman 
group G 3j i of [T3j, acting on {0, 1, #}*. The Thompson-Higman group G 3j i is isomorphic to a 
subgroup of the Thompson group V. 

As a Thompson group element, the transposition ry G G 3i i of Xi,Xj (0 < % < j) is 
defined as follows. The domain and image prefix code of ry is the finite maximal prefix code 

domC(ry) = imC(ry) = {0,lp' +1 U {0,lp#. 

On an argument in {0, 1} J+1 (i.e., the number of "boolean variables" in the argument is at least 
j + 1) we define 

where x { ,Xj G {0, 1}, u G {0, 1}\ and v G {0, lp'"*" 1 . 
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We also need to consider the case of an argument of the form z# where z = xqX\ . . . xt-\ G 
{0, 1} £ with £ < j. Here, the number of boolean variables in the argument is strictly less than 
j + in other words, the argument is "too short" for the transposition Tij. For those arguments 
we define Tij in such a way that 

• Tij is be a permutation of the boolean variables x , Xi, . . ., a^-i; 

• when£ = 0, riJ (#) = #. 

• when < i < j, Tij fixes Xo, i.e., Tjj maps the set 0{0, 1}* U 0{0, 1}*# into itself, and it 
maps 1{0, 1}* U 1{0, 1}*# into itself. 

The actual details of the definition when the argument is too short are a matter of convenience, 
and will be given later. However, we will completely define ro,i here, by letting it act as the 
identity map on {0, l}- 1 ^; and of course, xqx± h- > x±xq for all xq,xi G {0, 1}. Similarly, we 
completely define by letting it act as the identity map on {0, l}- 2 #; and x§X\Xi \— > x§xiX\ 
for all xq,X\,x-i G {0, 1}. For all i,j we define Ty to mean the same thing as r^j. 

The classical formulas about transpositions are still true for this definition of transpositions. 
For all k > 0, and for all x G {0, 1}*: 

Ti,j(x) = T i)k T k j(x), if \x\ > max{i, j, k) 

T id (x) = T M+ i T i+ i ii+2 ■ ■ ■ Tj-2,3-1 T j-2,j-l ■ ■ ■ n+l,i+2 Ti,i+l( X ) wnen < * < h 

and \x\ > j. 

For an argument x G {0, 1}*# that is "too short", we will simply define r it j by the second 
of the above formulas. Recall that initially we picked Tij(x#) to be arbitrary (subject to the 
requirement that Tjj should be a permutation of its domain code, and that Tjj should fix the 
left-most boolean variable when < i). Now rj j j + i(x#) is still arbitrary (for all < i, and 
x G {0, but all other T i: j(x#) (when \j — i\ > 1) are now defined in terms of the 

r M+ i(a;#). 

The classical formulas about transpositions, are now true on a maximal prefix code (see the 
Lemma below). For the first formula, the maximal prefix code is {0, lj m+1 u {0, l}- m #, where 
m = max{i, j, k}, and for the second formula the maximal prefix code is {0, U {0, 

Definition and notation. For a group G, a subset A C G, and an element g G (A)g, 
we define the word length of g over A to be the length of the shortest word over A ±:L that is 
equivalent to g in G. We denote the word length by \g\&- 

In summary, we proved: 

Lemma 2.1 As elements of the Thompson-Higman group G 3i i the transpositions satisfy the 
following equalities for all k > 0: 

n,j = n,i+l T i+l,i+2 ■ ■ ■ r 3-2,j-\ Tj-lj Tj- 2 ,j-l ■ ■ ■ Ti+l,i+2 (when < % < j). 

So the word length of T it j (0 < i < j) over the alphabet {r kjk+1 : < k} is < 2{j — i) — 1. 
We also have: 

Lemma 2.2 Let n > and x G {0, 1}*. 

T3n+I,3n+2(Z#) = ^21 r l,2 ^2l{ x if) , «/ \A > 3(n + 1); 
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T3n+2,3(n+l)(>#) = /% 2 " ^ 7i, 2 «l «32l(&#) 

= «321 r l,3 ^321 { X #)> if \ X \ > 3 (™ + !) + 1/ 

T3(n+l),3(n+l)+l(z#) = «321 ^l^ ^ r l,2 «2 «1 «32l(a#) 

= «M1 7-3,6 ^321 (*#), */ M > 3(n + 1) + 2. 

Every transposition Ti-ij (i > 0) has word length < 2i over {to,i, ti >2 , «i, « 2 , ^3}? and has word 
length < [~] over {r ,i, t 1j2 , r 3 ,6, «32i}- 

Proof. On an input x# as above, we can verify that 

X# = X X 4 X 5 X 6 X 7 X 8 X 9 . . . X 3k+1 X 3k+2 X 3 ( k+1) ... # I — ► 

xq X4X5X1 X7X8X2 • • • a;3(fc+i)+i a; 3(fc+i)+2a ; 3(fc-i) ■ ■ ■ ft 
For any n>0we can then verify the first formula: 

X# = Xq X 1 X 2 X 3 X i . . . X 3n+1 X 3n+2 . . . # ^> 

„ n,2 

^0 ^3n+i^3n+2^? • • • 2;?^? • • • W 1 — ► 
X X 3n+2 X 3n+ iX? . . . x?x? . . . # 

Xo X\X 2 X 3 . . . X 3n+2 X 3n+ i • • • 

Note that ^2, K i, an d r i,2 do not change xq. For the other two formulas the proof is very 
similar. 

For arguments xj^ that are "too short" we will define T iii+1 (x#) by the above formulas 
(when 1 < i). □ 

Remark on the definition of the transpositions: We defined T\ j2 and 7o,i earlier, and we 
gave formulas that define any ry in terms of transpositions of the form T ntTl+ i {n > 0). So, 
since the above Lemma defines 7^+1 (2;$) when is "too short", all transpositions are now 
completely defined as elements of G 3 ^. 

Remark on the role of the transpositions: The transpositions are elements of G^i, and 
G 3i i is finitely generated; let A 31 be a finite generating set for G 3) i. So we can write each r iji+1 
as a finite word over A^J. Why do want to use a generator like k 32 i which doesn't belong to 
(j3,i? The reason is complexity: Over A 3) i U {^321}, the word length of 7^+1 has a linear upper 
bound, but over A3 ; i alone, the word length of T^i+i has a lower bound which is exponential in 
i (as we will prove in Lemma f8. 61 and Theorem 18. 7|) . 



3 Simulation of a boolean function by a group element 

One problem in trying to simulate circuits by group elements is that the input-output function 
of a circuit is not necessarily a permutation. Obtaining permutations is a slightly stronger 
requirement than the classical problem of constructing injective (a.k.a. "reversible") circuits. 
See e.g. ^5], P] for the construction of injective Turing machines, and ^T] for injective 
circuits; the latter reference contains insightful comments on the physical significance of injective 
computing. 
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To do injective computing with non-injective functions, we apply the following transforma- 

tion from functions to permutations. For a function A i — > B, let Tf = {(x, f(x)) : x G A} be 
the graph of the function. Consider the transformation n defined by 

f n\ (A , , T, *(/) 



IT 



(A — ^ B) 



x G A. Note that 



— (Aur, ^ Aur f ), 

where n(f) is defined by x G A i — > (x, f(x)) G Tj, and (x, f(x)) eT/i 
7r(/) is a permutation of the set AUFf, for any function /. 

In programming, functions / are often tacitly replaced by 7r(/) because when an output is 
computed, people also want to remember the input. Note also that for two functions f\ and f 2 
with same domain set A and same image set B, we have f\ = f 2 iff 7r(/i) = 7r(/ 2 ). 

In this section we first associate elements of the Thompson-Higman group G 3jl with the 
elementary circuit components not, OR, and, and FORK. We base this on the above transfor- 
mation 7r. Then we define "simulation" of an acyclic circuit by an element of G^±; an element 
of G 3) i is described by a sequence of generators. Finally we prove that every acyclic circuit 
can be simulated by an element of G^i; moreover, this simulation provides a polynomial-time 
reduction of the equivalence problem of circuits to the equality problem of elements of G 3tl , 
restricted to the subset 0{0, 1, #}* of {0, 1, (the word problem with restriction) . In the next 
section we will go further and we reduce the word problem with restriction to the actual word 
problem. 

With the boolean functions not, or, and and, we associate the following elements of G 31 
(described by tables). 



^ = 



1 # 

1 # 



0X\X2 



identity 



on 



(x 1 V x 2 )x 1 x 2 (x l V x 2 )x l x 2 {0, l}- 2 # 



OX1X2 lXiX 2 

(xiAx 2 )xiX 2 (xi A x 2 )xix 2 



identity 

on 
{0,1}^ 2 # 



where Xi,x 2 range over {0, 1}. Hence the domain and image codes of ip v and ip A are all equal 
to {0,1} 3 U {0,lp#. 

The three functions above are length-preserving: |</? A (x)| = |x| for all x G Dom(</? A ), and 
similarly for </? v and </?_,. 

In order to represent the FORK function in circuits by an element of G 3i i a first idea would be 
to define a "0-fork" element of G 3i i (which duplicates a leading 0), as follows: 

r # 10 i# 11 
^ of [00 o# 01 # 1 

Then, r 0j i v? v ^(Ox) = Oxx (for all x G {0, 1}), so we could use this as a way to represent 
the FORK operation in a circuit. 
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However, it will turn out later that what we need is a forking operation that preserves the 
string length modulo 3. Thus, we define a "four-fold 0-fork" element of G 3; i (which turns a 
leading into four leading Os). 

To # 10 i# i 2 o i 2 # i 3 o i 3 # i 4 " 

m4 ~ [ 4 3 # 01 0# 2 1 2 # 3 1 # 1 

We have domC(v? f,4) = l- 3 {0, #} U {l 4 }, and imC(y9 f,4) = 0^ 3 {1, #} U {0 4 }. From the 
definitions one immediately verifies the following. 

Lemma 3.1 The maps Tij (where < % < j), y?of,4; ¥v> V^a belong to the Thompson- 
Higman group Gz,i, they stabilize the sets {0, 1}* and {0, 1}*#, they preserve lengths modulo 3, 
they map {0, 1}* into itself, and they map {0, 1}*# mto ifeeZ/. 

Notation: Let G C C/ 3jl ; note that "C" means that G isn't just a subgroup, but a particular 
embedding into Qz,i is considered. By C mod3 we denote the subgroup 

{<pEG : VxG{0,l}*, \<p(x)\ = \x\ mod 3}, 

i.e., the elements of G that, when restricted to {0, 1}*, preserve the length of strings modulo 3. 
In particular, we will use the notation G™ d3 for the corresponding subgroup of the Thompson- 
Higman group G 3i i. 

We point out that </?_,, </? v , y?A) an d all Tjj (0 < « < j) are length-preserving, and that </? f,4 
preserves length modulo 3. We will not use any other elements of G 3; i in the constructions and 
proofs in this Section. 

In order to obtain computational results we describe boolean functions by acyclic circuits, 
and we describe elements of G 3j i by words. Let us choose a finite set of generators A 31 of the 
group G 3>1 . For G 31 we also use the infinite generating set A 31 U {r iti+ i : < i}. 

Let C be an acyclic boolean circuit with m input variables xi, . . . ,x m and n output variables 
y±, . . . ,y n . Let f c ■ {0, l} m — > {0, l} n be the input-output function of C. Hence, two circuits 
C\ and C*2 are equivalent iff fc 1 = fc 2 - 

Our definition of "simulation" is a variation of the above transformation n. 

Definition 3.2 An element $ f e Gff 3 simulates a boolean function f : {0, l} m -> {0, l} n 

iff 

• £ne domain code and the image code of $j are subsets of {0, 1} {0, 1}* U {0, 1}* # 

• $/ maps 0{0, l} m mto 1+i(n) {0, l} n+m m sncn a way that 

QfiOx! ... x m ) = 1+i W /(ii,...,i m ) x x ■■■ x m 
where i(n) G {0, 1,2} is such that 1 + n + «(n) is a multiple of 3 (i.e., i(n) = —(1 + n) mod 
3j; so the role of i(n) is to make preserve lengths modulo 3; 

• <$>f and Qj 1 map the set {0, 1}* into itself, and map {0, 1}*# into itself; moreover, $j maps 
the set 0{0, 1}* into itself, and Qj 1 maps the set 1{0, 1}* into itself. 

When $/ is represented by a word Wf over A^} U {r^+i : < «} we say that Wf simulates f . 
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A boolean function / can be simulated by many elements of £3,1. 

By the above definition, if we {0, l}- m U {0, l}- m # then $/(0u>) tells us the value of / on 
input x\ . . . x m (where x\ . . . x m is the prefix of length m of w). The definition does not give any 
connection between $/(0xi . . . Xk #) and / when k < m (where x±, . . . , Xk G {0, 1}); we call 
this the "case when the input is too short". In some applications we want such a connection, 
hence we will need the definition of "strong simulation" below. (We cannot do much about 
the fact that <&f(lw) has no connection with /; since is an element of G^i, it is a bijection 
between maximal prefix codes, whereas / need not be injective nor surjective. So there has to 
be a big difference between and / somewhere.) 

Definition 3.3 We say that strongly simulates / iff in addition to the conditions of 
simulation (Definition \H. tyl . we have for all < k < m: <&f(0xi . .. is defined for all 

Xl ...x k e {0,i} fc . 

So for strong simulation, $/(0 x\ ... Xkjf) depends only on the function f and on k and on 
x\ . . . Xk', it does not depend on any particular circuit used to compute f . 

The next Lemma follows immediately from the definition of simulation. It gives a connection 
between the equivalence problem of circuits and the word problem with restriction of £3,1. For 
a Thompson group G (c Qz,i) with generating set A, and a subset S C {0, 1,#}*, the word 
problem with restriction is defined as follows: 
INPUT: Two words u, v over 

Question: Are the partial functions described by u and v the same when restricted to SI 

We denote the restriction of a partial function F to a set S by F\$- The next Lemma follows 
immediately from Definitions IH.2I and EH 

Lemma 3.4 Let f and g be any boolean functions with the same number of input variables 
and the same number of output variables. If f and g are simulated by $/ ; respectively $ 9; then 
we have 

f = g iff ($/)| {o,i,#}* = ($ fl )|o{o,i,#}' 
In the case of strong simulation we have, in addition, 

f = g iff ($/)|{o,#HO,i.#}* = ($<;)l{o,#}{o,i,#}* 

Let A 31 be a finite set of generators of the group G^i . For G 31 we also use the infinite 
generating set A 31 U {r iji+ i : < i}. With every acyclic boolean circuit C we want to associate 
a word Wc over the alphabet A^} U {r i i+1 : < i}, and we want the correspondence C t— > wc 
to be polynomial-time computable. For every word w over A^} U {jij : < i < j} we denote 
the length of w by |iw|, and we denote the largest subscript in any r^j occurring in w by J w . 

The size of an acyclic boolean circuit C is denoted by \C\; if C has k± gates of type not 
or FORK, k 2 gates of type and or OR, and n output variables, the size of C is defined to be 
\C\ = k\ + 2 • &2 + n. Equivalently, \C\ is the number of connections (wires, or edges in the 
circuit graph) between gates or from an input/output port to a gate (for that reason, gates 
with two input variables are counted twice). (Our definition of the size \C\ is slightly different 
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from the traditional definition, which just counts not, and, or gates and I/O ports, but it is 
linearly related to the traditional definition.) 

In an acyclic circuit every gate, and also every input or output variable, can be assigned 
a level (or "layer", or "depth"). The input variables of the circuit have level 0. A gate or an 
output variable has level 1 iff only input variables of the circuit feed into it. A gate or an output 
variable has level £ iff it receives input from levels < £ only, and at least one of its inputs comes 
from level £ — 1. The maximum level of any output variable is called the depth of the circuit. 

Theorem 3.5 There is an injective function C \— > wc from the set of acyclic boolean circuits 
to the set of words over the alphabet A^J U {t^+i : < i} (where A 3j i is a finite generating set 
ofG^i), with the following properties: 

(1) w c strongly simulates fc- 

(2) The length of uic satisfies \wc\ < C |C| 4 + c (for some positive constant c), 
and the largest subscript J wc of any T i>i+1 in wc satisfies J wc < c \ C\ 2 + c. 

(3) wc is computable from C in polynomial time, as a polynomial in \C\. 

To make sense of the phrase u wc is computable . . ." , we need to represent any transposition 
Tj ii+ i (with i G N) by a string over a finite alphabet; we simply write the integer i in unary 
notation (i.e., % is represented by the string 0*). 

Proof. We assume that the elements </?_,, </? v , <£a, ty?of,4 an d t~o,i belong to A 3jl . If this were 
not the case, we could express these by fixed words over another finite generating set of G^±. 

We can assume that our acyclic circuits are strictly layered, i.e., a gate or an output variable 
at level £ only receives inputs from level £—1. Hence, all the output variables of the circuit are 
at the same level L (L is the depth of the circuit). If the layering of a circuit C is not strict, 
we can insert identity gates to enforce strictness. An identity gate has one input variable and 
one output variable, connected by a wire; the two variables carry the same boolean value. In 
the present proof we will count these identity gates as gates in the definition of circuit size. In 
order to make a circuit C strictly layered, fewer than \C\ 2 identity gates need to be introduced. 
(Indeed, for each gate g we add at most as many identity gates as the depth of this gate g; 
so, in total we add at most |C| • depth(C) (< \C\ 2 ) identity gates). So the size increase 
is polynomially bounded. Moreover, identity gates will not affect wc, as we will see in the 
construction of Wc- 

A circuit C has input variables xi, . . . , x m , output variables y±, . . . , y n , and internal variables 
which correspond to the boolean values carried by internal wires (between gates or between a 
gate and an input or an output port). The internal variables at level £ are denoted y\, y^, ■ ■ ., 
y^ . When £ = L (output level) we have ul = n and yf = yf, and when £ = (input level) we 
have n = m and y® = x^. For every level £ (1 < £ < L), we consider a circuit Ci (called the 
slice ofC at level £). The input variables of Cg are . . ., yi~^, and the output variables are 
y[, . . ., y^ • the gates of C e are all the gates of C at level I. 

It will be convenient to use the notation Y e = y\y l 2 ■ ■ ■ y l ni (concatenation of all the variables 
yf), for <i < L. 

In order to define Wc let us first consider the case when L — 1, i.e, the circuit consists of 
just one slice. 
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Let k > and assume that for every circuit C of depth 1 and of size \C\ < k (where identity 
gates are counted as well), we can compute a word wc (over the alphabet Af\ U{rj i+1 : < i}). 

Any circuit C of depth 1 and of size k + 1 can be viewed as a circuit K of depth 1 and of 
size < k, with an additional gate (and, or, not, identity, or fork). Let x±, . . . ,x m be the 
input variables and let yx, . . . , y n be the output variables of K. 

Case 1: Suppose our circuit C is obtained from K by adding an identity gate or a not 
gate, with new input variable x m+1 and new output variable y n+ x- Note that only one wire 
can be connected to an input variable xf, we use explicit FORK operations when we want to 
duplicate a variable. In case a not gate is added, the input-output function of the new circuit 
is fc(x\, . . .,x m ,x m+1 ) = (yx, . . .,y n ,x^t), where f K {xx, ■ ■ ■ ,x m ) = [yx, . . .,y n ). The boolean 
function f c is to be simulated by a Thompson group element : {0, 1}* — > {0, 1}* such that 

^ f (0x 1 ...x m ,x m+1 ) = 1+l(n+1) y 1 . . . y n x~^{ x 1 ...x m x m+1 

for all xi, . . . , x m , x m+ i G {0, 1}, and such that $/ has the stability properties of Definition 13.21 
recall (as we saw in the Definition of "simulation") that i{n) = — (n + 1) mod 3, i(n) e {0, 1, 2}. 

Let wk and $/ A , G (j3 ; i be the simulation of fx, which exists by induction. We proceed as 
follows: 



0xi...x m x m+ i 3> o 1+ * (n) y x . . . y n x\ . . . x m x m+ i 
Case i(n) = 1: In this case we continue the simulation of fc as follows. 

yx y 2 ■ ■ ■ y n %i ■ ■ ■ x m x m+1 ' — > x m+1 yi . . . y n x x ■ ■ ■ x m yx i — > 

x m+ i x m+ i y 2 ■ ■ ■ y n xx ■ ■ ■ x m yx i — > x m+1 x m+1 yi . . . y n xx ■ ■ ■ x m yx i — > 

x^x~ yiy 2 ...y n xx...x m x m+l 
Applying r n+lin+2 r n , n+1 ... r 1>2 r ,i(-) then yields 

yx ■ ■ ■ yn X m +x Xx ■ ■ ■ X m X m +x- 
Thus our circuit C is simulated by the following word 

Wc = T n+ x,n+2 ■■■ Tx, 2 Tq,X V 9 - V^V T"2,n+m+2 W K - 

Case i(n) = 2: In this case we continue the simulation of fc as follows. 
yx . . ■ y n x x ■ ■ ■ x m x m+1 i — > x m+1 yx ■ ■ ■ y n xx ■ ■ ■ x m i — > 
^jn+i u x m +i yx ■ ■ ■ y n Xx ■ ■ ■ x m U I — > X m+ x u X m+ x Vl ■ ■ ■ Vn x x ■ ■ ■ x m U I — >■ 
Xm+x Oyx ■ ■ .y n x\ . . . x m x m +i 

Applying r n+ i jn+2 r ni „ + i ... n, 2 t ,i(-) then yields 

2/1 • • • Vn Xm+x Xx • • • X m X m+ i. 

Case i(n) = 0: In this case we continue the simulation of fc as follows. 

j/i . . . y n xx ■ ■ ■ x m x m+ i i — ► yx . . . y n x x ■ ■ ■ x m x m+1 i — > 

x m+ i yx . . . y n x x ■ ■ ■ x m ^ x m+1 x m+i y x . . . y n x x ■ ■ ■ x m 

x m+ i a; m+ i Qyx...y n xx...x m \J i — > 

x^pr yx . . . y n x x . . . x m x m+1 
Applying r n+2 ,„+ 3 r n+1)n+2 ... t 1>2 t 0) i(-) then yields 

00 j/ x . . . y n x^{ x x ■ ■ ■ x m x m+1 . 
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The case where, instead of a not gate, an identity gate is added is similar (except that we 
simply omit tp^). 

In any case the length of wc over the alphabet A 31 U {r^j+i : < i} is at most \wk\ + 
2 |r 2]n+m+4 | + 4 + n + 2. By Lemma I2.1| |r 2in+m+4 | < 2 (n + m + 2) — 1. Hence, < 
+ 4m + 5n + 12. Moreover, the subscripts of the transpositions appearing in wc are 
< max{n + m + 4, Jk}, where Jk is the largest subscript in any transposition appearing in %. 

In case we want to change the positions of the added variables x m+ i and y n+ \ (so that 
x m+ i is the ith input variable and y n +\ is the jth output variable), we apply other appropriate 
permutations (instead of r n+2 ,n+3 Tn+i,n+2 • • • 7"i,2 To,i and r 2in+m+ 4 above). This does not 
change our upper bound on \wc\- 

Case 2: Suppose our circuit C (still of depth 1) is obtained by adding an and gate or an 
OR gate to K, with new output variable y n +i and new input variables x m+ i, x m+ 2. Recall that 
only one wire can be connected to an input variable x\. We only deal with the OR case (the 
AND case being practically the same). The input-output function of the new circuit is 

fc(xi, ■ ■ ■ , X mi X m J r \ 1 XjnJf-2 ) = (yu-.. i Vm X m +i V X m j r 2 ) , 
where Jk(xi, ■ ■ ■ , x m ) = (yi, . . . , y n ). The boolean function fc is to be simulated by a Thompson 
group element $/ : {0, 1}* — > {0, 1}* such that 

$/(0 xx . . . x m x m+ ix m+ 2) = 1+i(n+1) yi...y n (x m+1 V x m+2 ) xi... x m x m+1 x m+ 2 
for all xi, . . . , x m , x m+ i, x m+ 2 € {0, 1}, and such that $/ has the stability properties of Definition 
IH.2I Let wk and $/ K G G^i be the simulation of J'k, which exists by induction. Then 

X\ . . . x m x m j r \X m j r 2 1 * y\ . . . y n X\ . . . x m x m j r \X m j r 2 
Case i(n) = 1: The simulation continues as follows. 

00 yi y 2 ■ ■ ■ y n %x ■ ■ ■ x m x m+1 x m+2 i — > i — > x m+ ix m+2 y 2 ■ ■ ■ y n %i ■ ■ ■ x m y x \ — > 

, v „ Ti,n+m + 2 T 2 , n + m+ 3 

(x m+ i V x m+2 ) x m+ iX m+2 y%...y n x\ . . . x m y x i — > i — > 
(x m+ i V x m+2 ) y x y 2 ■ ■ .y n x x . . .x m x m+1 x m+ 2 . 
By applying T n+ i, n+ 2 . . . r 1>2 r ,i we obtain 

2/l2/2 • • ■ 2/n (#m+l V X m+2 ) Xi . . . X m X m+ iX m+ 2- 

Case z(n) = 2: The simulation continues as follows. 

„„„ r l,n + m+3 T2,n + m+4 „ „ „ i^V 

000 yi y 2 • • • y n %i ■ ■ ■ x m x m+ ix m+2 i — ► i — > x m+1 x m+2 y x . . . y n x x . . . x m i — > 

( , . \ „ „ Tl. n + m+3 T2,n + m+4 

[X m+ i V X m+ 2) X m+ iX m+ 2 yi-..y n X! . . . X m I — ► I — > 

^m+2) 00 y\y2 ■ ■ ■ y n %i ■ ■ ■ x m x m+ ix m+ 2 ■ 
By applying r n+2) „ +3 . . . r 1>2 r ,i we obtain 

y\y2 ■ ■ ■ y n [%m+i V x m+2 ) xi . . . x m x m+ ix m+2 . 
Case z(n) = 0: The simulation continues as follows. 

„ ¥>0f,4 

yi y% . . . y n xi . . . x m x m+ ix m+2 1 — > 

Tl,n + m+4 T2,n+m+5 



0000 yiy 2 . . .y n xi. . .x m x m+1 x m+2 
x m+ iX m+2 7/1 . . . y n x x . . . x m 
l^m+i v x m+2 j x m+ ix m+2 u?/i . . .y n x\ . . . x m u u 1 > 1 > 
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(x m+1 V x rn+2 ) 000 yi . . . y n x x . . . x m x m+1 x m+2 
By applying r n+3in+4 . . . r 1;2 r ,i we obtain 

00 y x y 2 ...y n (x m+1 V x m+2 ) x x ...x m x m+1 x m+2 . 

Thus our circuit C is simulated by the word wc of length < \wk\ + 8m + 9n + 15 over the 
alphabet A 3 \ U {r i i+ i : < 2}. Moreover, the subscripts of the transpositions appearing in wq 
are < max{n + m + 5, Jif }, where Jk is the largest subscript in any transposition appearing 
in %. 

In case we want to change the positions of the added variables x m+ i, x m+2 and y n +i (so 
that x m+ i is the iith input variable, x m+2 is the i 2 th input variable, and y n +\ is the jth output 
variable), we apply other appropriate permutations (instead of T n+3jn+ 4 . . . r 12 r 01 , T 2jn+m+5 , 
and T l n+m+4 ). This will not change our upper bounds on \wc\ and Jc- 

Case 3: Suppose our circuit C (still of depth 1) is obtained by adding a FORK gate with 
a new input variable x m+ i and two new output variables y n +i and y n + 2 . The input-output 
function of the new circuit is 

fc( ) = (yi,... 

where fic(xi, ■ ■ ■ , x m ) = (yx, . . . , y n ). The boolean function fc is to be simulated by a Thompson 
group element such that 

$/(0xi . . .x m x m+1 ) = 1+j( " +2) y x . . . y n x m+1 x m+1 x x . . . x m x m+1 

for all xi, . . . ,x m ,x m+ i G {0, 1}, i{n + 2) = — n mod 3, and such that $/ has the stability 
properties of Definition 13.21 Let wk and §f K G G 3> i be the simulation of fx, which exists by 
induction. Then 

X\ . . . x m x m +\ i > y\ . . . y n X\ . . . x m x m j r \ 

Case i(n) = 2: We continue the simulation with 

000 yi...y n xx... x m x m+1 i — ► 00 x m+1 y x . . . y n %i ■ ■ ■ x m i — ► 

n n T0 ' 1 n n Vy 

x m +i u x m +i y x . . .y n X\ . . . x m u i — > u x m +ix m +i y\ . . . y n x\ . . . x m u i — ► 

Xm+lXm+lXm+1 2/l ■ ■ ■ 2/n 3^1 • • • X m U I > X m -^xXm-\-l U |/i . . . y n X\ . . . X m X m -^\ 

Finally we apply r n+lin+2 . . . r 3j4 r 1;2 and T n>n+1 . . . r 2j3 r ,i to obtain 

2/l2/2 ■ ■ ■ yn X m +iX m +i X\ . . . X m X m +\. 

Case i(n) = 0: We continue the simulation with 

yi . . . y n xi . . . x m x m+ i i — > 0000 yi . . .y n xi . . .x m x m+ i \ — > 

00 x m+1 y x . . . y n x% . . . x m ^ x m+ i x m+1 yi . . . y n X\ . . . x m ^> 

n n n </3v n n T2 .' i + m + 4 

u ^m+i^m+i u yi . . . y n x\ . . . x m u i — > x m +ix m +ix m +i u y\ . . . y n x\ . . . x m u i — > 

x m+i x ~m+i 00 y± . . . y n Xi . . . X m X m+ \ 

Applying r n+2 ,n+3 • • • r 3)4 n t2 and r n+hn+2 . . . r 2i3 r 0) i we obtain 

y±y 2 ■ ■ ■ y n x m+ ix m+ i x\ . . . x m x m +\. 

Case i(n) = 1: We continue the simulation with 

nn *Pot,4 nnnnn T2,„ +m+5 

00 y x . . . y n xi . . . x m x m+1 i — > 00000 y x . . .y n x x . . .x m x m+1 i — > 
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00 x m+ i 00 y 1 . . . y n xi . . . x m A x m+1 x m+1 00 yi . . . y n x ± . . . x m ^ 

x m+1 x m+1 00 j/x . . . y n X! . . . x m ^ ^m+i^m+i^m+i 00 yi . . . y n x ± . . . x m r2 '^ +B 

%m+l%m+l 000 y\ . . . y n X\ . . . X m X m+ i 

Applying r„ +3in+4 . . . r 3i4 n >2 and r„ +2 , n+3 • • • ^2,3 7"o,i we obtain 
00 y x y 2 ...y n x m +ix m +i x 1 . . . x m x m+1 . 

The above gives us a word w c of length \w c \ < \w K \ + 4m + 6n + 20 over the alphabet 
^3^1 U { r i,i+i : < «}, simulating / c . Moreover, the subscripts of the transpositions appearing 
in wc are < max{n + m + 5, Jr-}, where Jr- is the largest subscript in any transposition 
appearing in wk- 

In case we want to change the positions of the added variables x m+ i, y n +i, and y n +2 (so 
that x m+ i is the ith input variable, y n+ \ is the jith output variable, and y n +2 is the j 2 th 
output variable), we apply appropriate other permutations (instead of r n+3jn+4 . . . r 34 r 12 , 
Tn+2,n+3 • • • T2,3 To,i, and T2, n +m+5)- This does not change our upper bounds on \wc\ and Jo- 
in each of the three cases, the circuit C of depth 1 is simulated by a word wc over the 
alphabet A^} U {t^+i : < i}, of length \wq\ < 9 |if | + 20 + |u>x|- After < |C| construction 
steps (starting with K being the empty circuit, and ending with K being C), the length of wc 
will be \w c \ < § |C| 2 + 25|C|. The transpositions occurring in wc have maximum subscript 
< \C\ + 5. The above construction of each word wq from C is a polynomial-time algorithm. 

Inductive step: Assume that C has depth L > 1. In order to define wc we can use the 
fact that we have already defined the words wc e (1 < t < L) for the slices Ce of C. Indeed, each 
slice has depth 1, so the base of the induction applies. Each word wc e has all the properties 
claimed in the Theorem for circuit Q. In particular, wc e defines the map 

$ c< : Y^ 1 1 — ► 0^ Y e Y £ -\ 

Hence, since §c e is a right ideal isomorphism, we also have 

Y l ~ x O*^- 1 ) Y e ~ 2 0^-2) Y 1 O^™ 1 ^ Xi .x >— ^ 

Q^e) Y £ Y l ~ l O^"* -1 ) Y l ~ 2 i ( n t-'^ Y 1 0^ ni ^ Xi . . .x m 

Applying (<7i,nJ 1 ^ to this word yields 

Y e 0^™^ Y e ~~ l 0^" £ -^ Y e ~ 2 i ( ne - 2 ') Y 1 O^™ 1 ^ Xi . . .x m . 

where, in general, a it j denotes the permutation Tj-ijTj-2,j-i ■■■ T i+i,i+2 T i,i+i (•) (for all 
< i < 3). Therefore, 

{ai,n L ) i{ - nL) WC L W* 1 - 1 ' WC^ ... (<Jl,n e ) iM W Ce ••• famY™ W Cl 

defines the map 
0x 1 . . . x m 1 — > 

0yi...y n i{n) Y L - 1 O*^- 1 ) . . . Y l 0*^ . . . F 2 0^ F 1 O^™ 1 ) xi . . . x m (= def Z). 
Note that the length of the word Z is |Z|<1 + |C| + 2L<3-|C|. Indeed, the total number 
of variables in the circuit (i.e., i%l + • • • + n\ + m) is equal to the total number of wires (i.e., 
|C|); the comes from the leading letter 0; the "2L" comes from i(n), i(nx,_i), . . ., i{n\). 

Recall that y± . . . y n — Y L , and n L = n. 
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Now the permutation 7Ti = (<Ji j \z\) n+l ^ transforms the word Z into 

y l-\ *(nii_i) i(n<) . . . F 2 i(n2) F 1 i(ni) x 1 ...x m yi...y n i(n) . 

Note that the word length of m is less than (n + 2) • |Z| < 3(ra + 2) • |C| < 3 \C\ 2 over the 
alphabet {r iji+ i : < i}. 

Next (and this is a crucial idea in reversible computing), applying 
yields xi . . . x m y x . . . y n i(n) . 

Finally, applying the permutation 7r 2 = (cri, n +m+i(n)) m produces the desired final output 

0^ n ) y 1 ...y n x 1 ...x m . 
Therefore we can define w c (over the alphabet Af \ U {r iji+ i : < i}) by 

For the length we have therefore 

Since < ||Q| 2 +25|Q| (for 1 < £ < L) , and Eli l<^l = |C|,wehave Eli I^P < \C\ 2 - 

Also, i(n^) < 2, and |o"i iTt J < r^, so Y^t=iK n t) \ a i,n e \ < 2 |C|. Thus |u>c| < c • \C\ 2 , for some 
positive constant c. Also, the largest subscript in any permutation is < \Z\ < 3 |C|. Since |C| 
was squared in order to obtain strict layering, the above bounds become 

\w c \ <c\C\\ 

Jc < 3|Cf . 

The word wc can be written down in linear time, based on the words wc e (1 < I < L), and we 
saw that each wc e can be computed in polynomial time from CV 

In order to obtain a word that strongly simulates fc we need to make two additions to wc- 
A pre-processing step w is attached at the beginning (the right side) of wq, to make sure inputs 
that are "too short" are handled correctly. A post-processing step u>l+i is attached at the end 
(the left side) of wc, in order to remove excess letters introduced during pre-processing. Recall 
that we write functions to the left of the argument. The word that strongly simulates fc is 
denoted by Wc and defined by 

W c = w L+1 w c w 

We define w by 

w = r l,3(n+m)+l • • • T j,3(n+m)+j ■ ■ ■ T m,3(n+rn)+m ( < /"'0f,4) n+m ( - ) 

So, \w \ is bounded from above by a quadratic function in n + m, and J wo is linearly bounded 
in n + m. We have 

x ± . . . x m A x 1 ...x m 3 ( n+m ) ^ 1+i ^ f c (x!, . . . , x m ) x ± . . . x m 3 (™+ m ) . 

For < k < m we have on an input that is "too short" : 

xx . . . x k # h— — > zi . . . z k+3(n+m) # A 1+i(n) f c (z 1 , ...,z m ) Zi... z k+3(n+m) #, 
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where z\ . . . Zk+3( n + m ) is a permuted version of x\ . . . Xk 3 ^ n+m - ) ; this permutation depends only 
on the number k + 3(n + m). So the outcome 1+ *^ fc( z i, ■ ■ ■ , z m ) z\ . . . z k+3 ( n+m )# does not 
depend on the circuit C that was used to implement the function fc- 

Finally, it is also easy to verify that 

# A o 3 (™ +m ) # ^ o f c (o, . . . , o) o 3 ^™)- 1 # . 

We define wl+i by 

Wl+1 = (y0f,4) " m T n+m,3(n+m)+n+m ••• r j,3(n+m)+j ••• r l,3(n+m)+l (' ) • 

So, |wl+i| is bounded from above by a quadratic function in n + m, and J WL+1 is linearly 
bounded in n + m. One can verify easily that 

1+ ^ Vl . . . y n xi . . . x m 3 (" +m ) K 1+< ( n )yi . . . y n :ri . . . x m . 

For < k < m, and G {0,1}, let Zx . . . Zk+3( n +m) be the permuted version of 

x\ . . . Xk 3 ^™ +m ^ considered above. Let fc{zi ■ ■ ■ z m) — Vi ■ ■ ■ Vn\ n °t e that this string does 
not depend on the circuit C that was used to implement the function f c . 

Then the sequence of transformations wl+i will be applied to yi . . . y n z\ . . . Zk+3( n + m ) #■ 
This will produce a new string (e {0, 1}*#) which does not depend on the circuit C that was 
used to implement the function f c . 

Also, recall that on argument the outcome of the sequence of transformations wqWc is 
y%. . .y n 3 ^ n+m ^~ 1 #, where /c(0, . . . , 0) = y\ . . . y n . Then, applying wl+i yields a string 
(g {0, 1}*#) which does not depend on the circuit C that was used to implement the function 
fc- □ 

Remarks: The length of wq (over the infinite alphabet Af\ U {Tj i+1 : < z}), and the largest 
subscript J Wc (in any transposition occurring in wc) are bounded from above by polynomials in 
\C\. Hence, if we write subscripts of transpositions in unary notation, the length of wc remains 
bounded from above by a polynomial in |C|. 

The group G 3j i is finitely generated, so one may wonder what the word length of wc would 
be if Wc were expressed over such a finite generating set; we will see that it is exponential 
( Lemma 18.61 Theorem I8.7J1 . 

In the next section we reduce the above problem to a certain generalized word problem of 
G 3j i, still over the infinite generating set A 31 U {r iji+ i : < i}. 

4 Reduction to a generalized word problem 
(over an infinite generating set) 

We will now restate the above reduction as a reduction to a generalized word problem of a 
Thompson group, over an infinite generating set. In the following definitions we represent 
elements of £/ 3>1 by right ideal isomorphisms between essential right ideals of {0, 1, #}*. We will 
extend the classical concepts of stabilizers and fixators to the case of partial permutations. 

Definitions. We say that g partially stabilizes a set of words S C {0, 1, #}* iff g(S)Ug~ 1 (S) C 
S. So g maps S into itself wherever g is defined, and similarly for g~ l . For a subgroup G C £/ 31 , 
the partial stabilizer (in G) of S is 
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pStab G (S) = {geG : g(S) U g-\S) C S}. 

We say that g totally stabilizes a set of words S iff g(S) U g^i^S) C S, and in addition, 
5 1 C Dom(g) n lm(g). So g totally stabilizes S iff g partially stabilizes S and moreover, g and 
g -1 are defined everywhere on S. For a subgroup G C (/ 3)1 , the total stabilizer (in G) of S" is 

tStab G (S) = {«eG: ^Uf^CSCDom^nlmfe)}. 

We say that g partially fixes a set S iff g(x) = x for every i e 5 PI Dom(g) n Im(g); this is 
also called partial "pointwise stabilization". For G C (? 31 , the partial fixator (in G) of S is 

pFix G (S') = {g£G: (Vac G 5 n Dom(g) n Im(g)) g(z) = a;} 

i.e., the elements g of G that fix every point in 5 on which g and g -1 are defined. We can also 
define the total fixator by 

tFix G (£) = {geG: S C Dom(g) n Im(g) and (Vr G 5) flr(ar) = ar}, 

i.e., the elements g of G that fix every point in S and such that g and g -1 are defined on every 
point of S. This completes the definitions of stabilizers and fixators. 

Observe that when R C {0, 1, is a right ideal generated by a maximal prefix code P 
(over the alphabet {0, 1, #}), then 

pFix G (Pt) = tFix G (_R) = tFix G (P). 

So, for right ideals, the notions of partial fixator and total fixator coincide. Moreover, for every 
right ideal S C R such that S is essential in R (i.e., S 1 has a non-empty intersection with every 
right ideal contained in R), we have: 

pFix G (S) = pFix G (i?). 

It is easy to see that tStab G (X) and tFix G (X) are always groups (for any group G C Q 31 and 
any set X C {0, 1, #}*). However, pStab G (X) and pFix G (X) are not always groups. For this 
paper, all we need is the next Lemma. 

Lemma 4.1 Let G C (/ 31 . For any set X of words over {0, 1, tStab G (X) and tFix G (X) 
are subgroups of G. For any right ideal R of {0, pFix G (i?) is a subgroup of G. 

If G C G 31 and i/ B* is any free submonoid of {0, 1, #}* (generated as a submonoid by a 
set of words B C {0, 1, £/ien pStab G (S*) and pFix G (£?*) are subgroups of G. 

Proof. The sets tStab G (X), tFix G (X), pFix G (_R), and pStab G (5*) are closed under inverse, 
by definition. The closure under multiplication is obvious for tStab G (X) and tFix G (X). And 
when R is a right ideal we saw that pFix G (i?) = tFix G (i?) 

If x G B* and y2 2 , ifi G pStab G (5*), and if (m&x<p 2 ( Pi){x) is defined, we need to show that 
(max.ip2ipi)(x) G B*. Note that ipi(x) and ip2<fx(x) might be undefined; but in any case, there 
exists w G B* such that ^2^1 (xw) is defined; we just need to take w long enough. Then we also 
have (p2(pi(xw) G B* and (p2(fx{xw) = (max<^2<^i)(£w) = (maxi/)2^i)(i) • w. Therefore, since w 
and (max<y22<^i)(x) -w belong to B*, and since B* is free, we conclude that (max^2<^i)(i) G B* . 
The proof for pFix G (S*) is very similar. □ 

With this terminology we can restate Lemma 13.41 
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Lemma 4.2 Let f and g be any boolean functions such that f and g have the same number of 
input variables, and f and g have the same number of output variables. If f and g are simulated 
by <&f, respectively $ 5 , then the following are equivalent: 

• f = g 

• $7 1 $,epFix G§ ,od3(0{0,l ) #}*) 

In the case of strong simulation the following are equivalent: 

• f = g 

. ^^epFixo^ao^HO,!^}*) 

• G pFix^od 8 ({0,#}{0,l,#}*) 

Theorem 13.51 and Lemma 14.21 give a polynomial-time one-to-one reduction from the equiva- 
lence problem for acyclic circuits to the generalized word problem of pFix G mod3(0{0, 1, #}*) in 
G 3i i, with elements of G 3) i written over the set of generators A 31 U {t^+i : < i} (where A 31 
is a finite generating set of G 3) i). It follows that this generalized word problem is coNP-hard. 
Because of the existence of a strong simulation, we also have a polynomial-time one-to-one 
reduction from the equivalence problem for acyclic circuits (with last output variable when 
the inputs are all 0) to the generalized word problem of pFix G mod3({0, #}{0, 1, #}*) in G 31 over 
the set of generators A 3i i U {t^+i : < i}. Hence we have: 

Corollary 4.3 (co-NP hard generalized word problem). The generalized word problems 
of pFix G mod 3(0{0, 1, #}*) and of pFix G mod3({0, #}{0, 1, #}*), as subgroups of Cr 3) i (with 
generating set A 3j i U {t^+i : < i}) are coNP-hard. 

The following subgroups of G 3 i will play a major role. 

Definition 4.4 The groups of bit-preserving (or {0, 1} -preserving) transformations, G 3j i(0, 1) 
and G 3 n ° d3 (0, 1), are defined by 

G 3il (0,l) = pStab G31 ({0,l}*) 

= {0GG 3il : 0({O,1}*)C{O,1}* and ^({O, 1}*) C {0, 1}* }, 

G^ d3 (0,l) = P Stab G „ : od3({0,l}*) 

= {0GG 3il (O,l) : 

|0(x)| = \x\ mod 3, for all x G {0, 1}* for which <p(x) is defined}. 

The groups of ^-preserving transformations, G 3j i(0, 1; if) and G 3 n ° d3 (0, 1; #), are defined by 

pStab G31 ({0,l}*) n tStab G3il ({0,l}*#). 
{0GG 3 ,i(0, 1) : for all xG {0,1}*, 

<f>(x#) and _1 (x#) are defined and 0(x#), _1 (x#) G {0, 1}*#}, 

= pStab G mod 3 ({0, 1}*) n tStab G mod 3 ({0, 1}*#) 
{0GG 3il (O,l;#) : 

|0(x)| = \x\ mod 3 for all x G {0, 1}* for which 4>(x) is defined}. 



G 3) i(0,i;#) = 

G™ d3 (0,l;#) 
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It follows from Lemma IP that G 3 ,i(0,l), G 3 ,i(0, 1; #), G^ d3 (0,l), and G£? d3 (0, 1; #) are 
indeed groups. 

All the elements of G% t \ that we have used in the proof of Theorem 13.51 are generated by 
<£-,, ify, ip A , Tij (0 < i < j), and <fof,4- These elements also belong to G 3 n ° d3 (0, 1; #) (c 
G 3 n ° d3 (0, 1)). Hence, the above Corollary implies the following, where A# is a finite generating 
set of Gff 3 (0, 1; #), and A (0 ,i) is a finite generating set of G^ d3 (0, 1): 

Corollary 4.5 (co-NP hard generalized word problem). The generalized word problems 
of pFix G (0 {0, 1, #}*) and of pFix G ({0, #}{0, 1, #}*) as subgroups of G = G 3 n ? d3 (0, 1; #) are 
coNP-hard. Here the generating set used for G™ d3 (0, 1; #) zs A# U {r^+i : < «}. 

The generalized word problems of pFix G -(0 {0, 1, #}*) and of pFix G ({0, #}{0, 1, #}*), as 
subgroups of G = G 3 n ° d3 (0,l) are coNP-hard. Here the generating set used for G 3 n ° d3 (0, 1) is 
A( ,i) U {r< )i+ i : < «}■ 

We will see later that G™° d3 (0, 1; #) and G™° d3 (0, 1) are finitely presented, so the finite gener- 
ating sets A# and A( 0) i) exist. 

Here is a more concrete view of the subgroup G 3 n ° d3 (0, 1; #): 

Lemma 4.6 The group G 3 n ° d3 (0, 1; jf) consists of the elements of G^\ that have tables of the 
form 

x x ... x n x[# ... x' m # 

_yi ■■■ Vn yx# ■ ■ ■ y'm# \ ' 

for some positive integers n, m, with X\, . . ., x n , x' l! . . ., x' m , yi, . . ., y n , y[, . . ., y' m G {0, 1}*, 
and \xi\ = \ui\ mod 3 (for alii = l,...,n). Moreover, {x 1 ,...,x n } U {x[, . . . , x' n }# and 
{yi, . . . , y n } U {y[, . . . , y' n }i^ are maximal prefix codes over {0, 1, 

Proof. From the shape of the above table we see immediately that the corresponding element 
of G 3) i, as well as 0"\ map {0,1}* into {0,1}*, and {0, 1}*# into {0, 1}*#. On {0,1}*, 
preserves length modulo 3. Thus, G pStab G mod3({0, 1}*). Moreover, since {xi, . . . ,x n } 
U {x[, . . . , x' n }# is a maximal prefix code, 0(w#) is defined for all w G {0,1}*. Similarly, 
is always defined. Thus, G tStab G mod3({0, 1}*#). 

Conversely, if G Gff 3 {0, 1; #) then the domain code of is a subset of {0, 1}* U {0, 1}*# 
(since partially stabilizes {0,1}* and totally stabilizes {0, 1}*#). For the same reason, the 
image code of is a subset of {0, 1}* U {0, 1}*#. By Lemma 14.71 and the definition of 
G™° d3 (0, 1; it now follows immediately that has a table of the above form. □ 

Lemma 4.7 (1) If P C {0,1}* U {0, 1}*# is a maximal prefix code over {0, 1, #} taen 
P = Pi U P2# /or some Pi, P2 C {0, 1}*, t/ie following properties: 

• Pi is a maximal prefix code over {0, 1}; 

• P2 = {p 6 {0, 1}* : p is a strict prefix of some element of Pi}. 
When Pi is finite, this last property implies: IP2I = |Pi| — 1. 

(2) Conversely, if P = Pi U P 2 # for some Pi,P2 C {0, 1}* with the above two properties, 
then P is a maximal prefix code over {0, 1, jf\. 
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Proof. The proof is not difficult and appears in the Appendix. □ 
Similarly, Gr™° d3 (0, f ) has a concrete description. 

Lemma 4.8 The group G™ d2, {Q, 1 ) consists of the elements of G 3 ,i that have a table of the 
form 

X\ . . . X n X-^^Si . . . X m ^j z S m 

_yi ■■■ y n yi#t m ■ ■ ■ y' m #t m \ ' 

for some positive integers n,m, with x\, . . ., x n , x[, . . ., x' m , y±, . . ., y n , y[, . . ., y' m G {0, 1}*, 
and Si, . . ., s m , t\, . . ., t m G {0, f , #}*, and \xi\ = \yi\ mod 3 for all i = 1, . . . , n. Moreover, 
{x u ...,x n } U {zi#si,...,a4#s m } and {y u . . . , y n } U {y[^t 1: . . . , y' n #t m } are maximal 
prefix codes over {0, f , #}. 

Proof. The proof is similar to the proof of the corresponding Lemma for G^ d3 (0, 1; #). □ 

In the next section we will reduce the above generalized word problems to the word problem 
of G 3i i (still over the infinite generating set A 31 U {r iji+ i : < i}). 

5 Reduction to the word problem of a Thompson group 
(over an infinite generating set) 

We will give a linear-time /c-ary conjunctive reduction (for a constant k) from the generalized 
word problem of pFix G mod3({0, #}{0, 1, #}*) to the word problem of G 3i i, over the infinite 
generating set A 3jl U {r iji+ i : < i}. 

Definition 5.1 A polynomial-time k-ary conjunctive reduction from a language L C A* 
to a language W C B* is a function f : A* — > (B*) k such that f(x) is computable in time 
bounded by a polynomial in \x\, and such that we have: 

x e L iff f(x) = (y 1 , ...,yk) with y { G W for alii (1 < % < k). 

Any polynomial-time k-ary conjunctive reduction, for some constant k, is called polynomial-time 
constant- arity conjunctive reduction. 

The conjunctive reductions used in this paper will have a constant arity. More general defini- 
tions of polynomial-time conjunctive reductions are possible (where the arity k is a polynomial 
function of \x\), but we will not need this here. Conjunctive reductions are a special case of 
truth-table reductions. Note that the classes P, NP, and coNP are closed under polynomial-time 
constant-arity conjunctive reduction. 

In the classical theory of permutation groups there are many results of the following form: 
Let G be a permutation group acting on a set X (i.e., G C &x), and let Qi, Q2 be two 
"complementary" subsets of X. Then for all g G G we have: 

(C) g G Fix G (Qx) iff gh = hg for all h G Fix G (Q 2 ). 

We call property (C) the commutation test for the generalized word problem of Fix G (Qi). 
The left-to-right implication is obvious. For the right-to-left implication to be true, special 
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assumptions have to be made on G, on its action (i.e., on the embedding G <^-> &x), and on 
the meaning of "complementary" . 

What is interesting about the commutation test (C) is that it reduces the generalized word 
problem of FixGt(Qi) (as a subgroup of G) to N instances of the word problem of G, where N 
is the minimum number of generators of Fixg(Q2)] indeed, g commutes with all elements h in 
FixG^Qx) iff g commutes with all the members of a generating set of Fix G (Qx). So, if Fix G (Q 2 ) 
is finitely generated then we obtain a constant-arity conjunctive reduction of the generalized 
word problem of Fixg{Qi) to the word problem of G. 

In this Section we prove our version of the commutation test, namely Theorem 15.51 below. 
Since we deal with partial actions (Thompson groups), everything is somewhat different from 
the classical case. We first introduce some concepts about prefix codes and fixators. 

We make the following convention: Let : A* — > A* be a partial map and x £ A*; when we 
write 0(x) it is to be understood that 0(x) is defined (i.e., x G Dom(0)). 

Definition 5.2 Let A be a finite alphabet. Two prefix codes P,P' C A* are complementary 
prefix codes iff P U P' is a maximal prefix code over A, and PA* D P'A* = 0. 

Definition 5.3 Let A be a finite alphabet with \A\ = n, and let G C Q n l (i.e., G is a subgroup 
with a particular embedding). The fixator pFix G (P' A*) is called maximal iff there exists P C A* 
such that P, P' are complementary prefix codes, and such that we have: 

for all x G PA* there is h G pFix G (P'yl*) such that h(x) ^ x. 

Equivalently: The fixator of a right ideal P'A* is maximal iff it does not fix any larger right 
ideal than P'A*. 

Recall our convention that when we write 4>(x) (for a partial map 0) it is to be understood that 
4>(x) is defined (i.e., x G Dom(0)). 

In analogy with G^ d3 (0, 1) and Gff 3 (0, 1; #) we use the notation 
^7 d3 (0,l) = pStab e »o d 3({0,l}*), 

S 3 7 d3 (0,l;#) = P Stab^od 3 ({0,l}*) n tStab 03T i3({O,l}*#). 

Definition 5.4 Let G C ^3 1 ° d3 (0, 1; #) be a group. Let P, P' be complementary prefix codes 
over {0, 1, #} ; with P n {0, 1}* ^ 0, P' n {0, 1}* ^ 0, and P, P' C {0, 1}* U {0, So, 
P = Pi U P 2 #, and P' = P[ U P 2 '#, according to Lemma\Jl[ 
The fixator pFix G (P'{0, 1, #}*) is separating on P{0, 1, iff the following hold: 

• For any ordered pair of prefix-incomparable words (x,y) with x, y G Pi{0, 1}* ; there exists 
h G pFix G (P / {0, 1, #}*) and there exists u G {0, 1}* such that 

h(xu) = xu and h(yu) ^ yu. 

• For any ordered pair of prefix-incomparable words (x,y) with x, y G Pi{0, 1}*# U P2# 
there exists h G pFix G (P'{0, 1, #}*) such that 

h(x) = x and h{y) ^ y. 

We will not need any explicit separation requirements in the case where x G {0, 1}* and y ^ 
{0, 1}*, or the case where x {0, 1}* and y G {0, 1}*. Also, note that for words x, y G {0, 1}*#, 
x, y are prefix-incomparable iff x ^ y. 
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Theorem 5.5 (Commutation test for pFix G (0 {0,1, #}*) ). Let G = Gff 3 (0 } 1; #). 
Then for any g G G we have: 

<?GpFix G (0{0,l,#}*) iff gh = hg for all h G pFix G ({l, #}{0, 1, #}*). 

This Theorem follows immediately from the following two Propositions, 15.61 and 15.71 

Proposition 5.6 Suppose pFix G (P'{0, 1, #}*) is separating on P{0, 1, jf\* , where G, P , and 
P' are as in Definition \5.4\ Then for all g G G we have: 

If g commutes with all elements of pFix G (P'{0, 1, #}*) then g G pFix G (P{0, 1, #}*). 

Proposition 5.7 Let P, and P' be as in Definition \54\ and let G = G™° d3 (0, 1; #). Then 
the fixator pFix G (P'{0, 1, #}*) is separating on P{0, 1, #}*. 

Before proving Propositions 15.61 and 15.71 we need some lemmas. 

Lemma 5.8 Let P C A* be any prefix code, where \A\ — n > 2. Assume if G pStabg n (PA*), 
but (f G" pFixg n (PA*). Then there exists x G PA* such that x and <p(x) are not prefix- 
comparable. 

In particular, if <p G Q n ,i is not the identity element then there exists x G domC(^) such 
that x and <p(x) are not prefix- comparable. 

Proof. The proof is in the Appendix dedicated to properties of prefix codes. □ 

Lemma 5.9 Suppose P, P' C A* are complementary finite prefix codes. Let x\, . . . , Xk G PA* 
(for any positive integer k), and assume x\, . . ., Xk are two-by-two prefix-incomparable. Then 
for all n of the form n — 1 +i (\A\ — 1), with n> \P\ — k + (\A\ — 1) (\x\\ + . . . + \xk\), there 
exists a prefix code Q such that 

• Q U {xi, . . . , Xk} and P' are complementary prefix codes, with Q U {xi, . . . , x^} C PA* ; 

• \Q\ = n. 

• The set of prefixes of P is a subset of the set of prefixes of Q U {xi, . . . ,XjA-. 

Proof. The proof is in the Appendix dedicated to properties of prefix codes. □ 

Lemma 5.10 Let G, P, and P' be as in Definition \5.4\ If pFix G (P'{0, 1, #}*) is separating 
on P{0, 1, #}* then it is a maximal fixator. 

Proof. Suppose by contradiction that there exists Xq G P{0, 1, #}* such that h(xo) = Xq for 
all h G pFix G (P'{0, 1, #}*). The prefix code P is of the form P x U P 2 #, with P u P 2 C {0, 1}*, 
by Lemma f4. 71 

Case I: x G Pi{0, 1}*. 

Choose x = xqO and y = XqI. Then x and y are prefix incomparable, hence by the separation 
property of the fixator, there exists h G pFix G (P'{0, 1, #}*) and u G {0, 1}* with 

h (xu ) = xu , h (yu ) ^ yu . 

However, ho(yuo) ^ yuo contradicts the fact that ho(xo) = xq. 

Case 2: x G Pi{0, 1}*#, or x G P 2 # with |P 2 | > 2. 
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Let xq = t>o#- Let wq G P2 with wq 7^ vq, and choose x = wo# and y = t>o#- Then x and 
1/ are prefix incomparable, and both are in {0, 1}*#; so there exists h G pFix G (P'{0, 1, #}*) 
with 

/i (x) = x, h (y) ^ y. 
However, h (y) 7^ y contradicts the fact that h (x ) = x o- 

Case 3: xo G p2# and | -F2 1 = 1- (Obviously the case | -F2 1 = cannot occur when x G p2#-) 
Then P 2 = {^o}? so we have x = t> #. Let z G Pi (recall that in the Definition 15.41 we 
assume that Pi 7^ 0). Let x = z # and y = x = v #. Since z 7^ v , x and y are prefix 
incomparable, and both are in {0, 1}*#; so there exists h G pFix G (P'{0, 1, #}*) with 

/i (x) = x, /i (y) 7^ y. 

Again, h (y) 7^ ?/ contradicts the fact that h (x ) — ^o- 1=1 

Proof of Proposition 15. 61 Let g G G and assume g commutes with all elements of 
pFix G (P'{0, 1, #}*). We want to show that g G pFix G (P{0, 1, #}*). We first prove: 

Claim: g stabilizes P'{0, 1, #}* and P{0, 1, #}*. 

Proof of the Claim: Assume by contradiction that g(x') = y for some x' G P'{0, 1, #}* 
and y G P{0, 1, #}*. Since g commutes with all elements of the fixator we have for all h G 
pFix G (P'{0, 1, #}*): gh(x') = hg(x') = g(x') = y, i.e., h{y) = y. This contradicts the 
maximality of the fixator pFix G (P'{0, 1, #}*), proved in Lemma [5.101 So g maps P'{0, 1, #}* 
into itself. 

In a similar way one proves that g~ x maps P'{0, 1, #}* into itself. It follows from this that 
g also maps P{0, 1, #}* into itself. Indeed, if we had g(x) = y' for some x G P{0, 1, #}* and 
y' G P'{0, 1, #}* then g~ x {y') = x, contradicting the fact that g~ l maps P'{0, 1, #}* into itself. 

Similarly, g~ x maps P{0, 1, #}* into itself. This proves the Claim. 

Assume now by contradiction that g does not fix some element X\ G P{0, 1, #}*: g(xi) = yi 7^ 
xi. By the Claim, ^ G P{0, 1, #}*. 

By Lemma f5.8l there exist x, y G P{0, 1, $=}* such that x and ?/ are prefix incomparable and 
g(x) = y. And since g commutes with the fixator, we have for all h G pFix G (P'{0, 1, #}*): 
gh(x) = hg(x) = h(y). 

On the other hand, the separation property of the fixator implies that there exists h G 
pFix G (P'{0, 1, #}*) and u Q G {0, 1, #}* (with u empty if x, y G {0, 1}*#), such that h (yu ) 7^ 
yuo and h (xuo) = xuq. 

The equality g/i(x) = h(y) implies gho(xuo) = h(yuo); this, together with ho(xuo) = xuo, 
implies yuo = gho(xuo) = h(yu ). But this contradicts ho(yuo) 7^ yuo- □ 

Lemma 5.11 (1) For all x,y G {0,1}* i/iere exzsi letters £±,£2 G {0,1} stzc/i i/iai x£i, and 
2/^2 ore prefix incomparable. 

(2) For a// x,y,z G {0,1}* i/iere exzsi letters £\,...,£q G {0,1} sitc/i i/iat x£i£3 ; yl^l^, and 
z£^£§, are prefix incomparable. 

Proof. The proof is in the Appendix dedicated to properties of prefix codes. □ 

Notation: When S C A*, 

> P re{(S) = {p G A* :p > pref s, for some s G S}, 



29 



i-e., > P rcf(5') is the set of all prefixes of words of S. 

>pref(S) = {p e A* :p > prcf s, for some s G S}, 
i.e., > pre f(5') is the set of all strict prefixes of words of S. 

Proof of Proposition 15771 Let x, y G Pi{0, 1}* and assume x and y are prefix incomparable. 
We want to find h G pFix G (P'{0, 1, #}*) and «o G {0,1}* such that ho(xuo) = xuo and 
h (yu ) ^ yu . If x,y G {0, 1}*# then u is empty. 

Case 1: x,y e Pi{0,l}*. 

The words x,y0,yl are prefix-incomparable two-by-two (for x and yO, use Lemma ESI and 
similarly for x and yl). Now use Lemma to construct a maximal prefix code QU{x, yO, yl}U 
P', with Q C P{0,1,#}*. 

Define h E G = G^ d3 (0, 1; #) by 

h (y0) = yl, h (yl) = yO, h (x) = x, and h is the identity on Q U P'. 

So, QU{x, |/0, yljUP' is the domain code and image code of h . Note that h preserves lengths. 
Then h G pFix G (P'{0, 1, #}*), h (y0) ^ yO, and /i (x0) = xO (since h (x) = x). So here, 
plays the role of uq in the separation property. 

Case 2: x,y e Pi{0, 1}*# u P 2 #. 

Let x = x # and ?/ = y # 
Case 2.1: y g Pi{0, 1}*. 

Either Xq is different from both y and ?/oO, or x is different from both y and j/qI- We only 
consider the case where Xq is different from both y Q and yoO; the other case is similar. 

• Assume xq G P2. 

By Lemma 15.91 over the alphabet A = {0, 1}, there is a finite prefix code Q\ C Pi{0, 1}* 
such that Qi U {2/0OO} and P[ and complementary prefix codes (over {0,1}). Therefore the 
following set C C {0, 1}* U {0, 1}*# will be a finite maximal prefix code over {0, 1, #}: 

C = Qi U {y 00} U P/ U > prcf (Qi U {y 00} U P[) #, 
Now we define h , with domain code and image code C, by 

^o(?/o#) = 2/oO#, h (y 0#) = yo#, and /io is the identity everywhere else on C. 
Thus, h (y) ^ y. Moreover, h G pFix G (P'{0, 1, #}*) because y ,y <£ P 2 '; indeed, y ,y 
GP{0,1}* CP{0,1,#}*. 

And Hq preserves the length of strings in {0, 1}* (since ho is the identity on {0, 1}* wherever 
ho is defined). 

We also claim that h (x) = x. Indeed, x belongs to P 2 , which is contained in > pref (P 1 UP/); 
moreover, > pre f(Pi) C > pr ef(Qi)5 by the 3rd point of Lemma l5~TH Therefore, x # belongs to 
C. On the other hand, Xq is different from y and yoO. 

• Assume x G Pi{0, 1}*. 

Then, by Lemma 15. Ill there are £1,^2 G {0, 1} such that x £i and y £ 2 are prefix incompa- 
rable; also, xo£\, y§ti G Pi{0, 1}*. By applying Lemma 1531 over the alphabet A = {0,1} we 
obtain a finite prefix code Q\ C Pi{0, 1}* such that Qi U {xo^i, yo^O} and P{ and complemen- 
tary prefix codes (over {0, 1}). Therefore the following set C C {0, 1}* U {0, 1}*# will be a 
finite maximal prefix code over {0, 1, #}: 
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C = Q 1 U {x £i, y&O} U P{ U > prcf (Qi U {x £i, yohO} U P{) #. 
Now we define /iq, with domain code and image code C, by 

ha{vx>if) = yoh#, h Q (y £2#) = 2/o#, an d h is the identity everywhere else on C . 
Thus, /i (y) ^ y. Moreover, h E pFix G (P'{0, 1, #}*), because y , y £ 2 <£ P' 2 \ indeed, y , y £ 2 

ePi{o,i}* cP{o,i,#}*. 

And /iq preserves the length of strings in {0, 1}* (since ho is the identity on {0, 1}* wherever 
it is defined). Also, h (x) = x, since xo# belongs to C (since xq is a strict prefix of x £i), and 
since Xo is different from y and 2/0^2 • 

Case 2.2: y e P2. 

Since Pi 7^ 0, there exists w G Pi; hence is different from U7 , woO, and woOO. Also, 
Xo is different from wqO or from w 00 (or from both). Let ZoO be one of WoO or w 00, so that 
z 7^ x . We still have z 7^ y and z G Pi{0, 1}*. 

• Assume xo G P2. 

By Lemma IHUl over the alphabet A = {0, 1}, there is a finite prefix code Q\ C Pi{0, 1}* such 
that Q\ U {^oO} and P{ and complementary prefix codes (over {0, 1}). Therefore the following 
set C C {0, 1}* U {0, 1}*# will be a finite maximal prefix code over {0, 1, #}: 

C = Qi U {2 0} U P[ u > prcf (Qi U {z 0} U P[) #. 
Now we define h , with domain code and image code C, by 

/i (z/o#) — z ojf, ho(z #) = yo# 5 an d h is the identity everywhere else on C. 
Thus, h (y) 7^ y and /io(aO = Note that /io(^o#) and hoiyoif) are defined since xo,yo E P 2 C 
> P ref(Pi U Pi); moreover, > pre f(Pi) C > pre f(<3i), by the 3rd point of Lemma E33 Therefore, 
and ?/o# belong to C. 

Also, h G pFix G (P'{0, 1, #}*), because y ,z P' 2 ] indeed, y , z G P 2 U Pi{0, 1}* C 
P{0,1,#}*. 

Also, ho preserves the length of strings in {0, 1}* since ho is the identity on {0, 1}* wherever 
it is defined. 

• Assume x G Pi{0, 1}*. 

Then, by Lemma 15.111 there are £i,£ 2 G {0,1} such that Xq£i and z £ 2 are prefix incom- 
parable. By applying Lemma 15.91 over the alphabet A = {0, 1} we obtain a finite prefix code 
Qi C Pi{0, 1}* such that QiU{xo£i, zo£ 2 } and P[ and complementary prefix codes (over {0, 1}). 
Therefore the following set C C {0, 1}* U {0, 1}*# will be a finite maximal prefix code over 
{0,1, #}: 

C = Q x U {x £x, zo£ 2 } U P{ U > prcf (Qi U {x £i, z £ 2 } U P[) #. 
Now we define h , with domain code and image code C, by 

hoiyoU 1 ) = ^o#, ho(zoif) = yo#, and ho is the identity everywhere else on C. 
Thus, h (y) 7^ y, and y E C (since y E P 2 C > pre f(Pi U P{) C > prcf (Qi U P[) ). 

Moreover, fo G pFix G (P'{0, 1, #}*), because y , ^0 ^ Pj> indeed, y ,z E P 2 U Pi{0, 1}* 
CP{0,1,#}*. 

And /io preserves the length of strings in {0, 1}* (since h is the identity on {0, 1}* wherever 
it is defined). Also, h (x) = x, since x # belongs to C (since x is a strict prefix of x £i), and 
since xq is different from yo and zq- □ 
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As we observed near the beginning of this Section, the circuit equivalence problem reduces 
to the generalized word problem of Fix G (0 {0, 1, if}*), as subgroup of G = G™° d3 (0, 1; if). The 
generating set used for G is A 0) i ; # U {t^+i : < i}, where A 0) i ; # is a fixed finite generating set 
of G™° d3 (0, 1; if). We will prove in the next Section, and independently of this Section, that 
G™° d3 (0, 1; #) is finitely presented. 

Theorem 15 . 51 reduces the circuit equivalence problem to the word problem of G™° d3 (0, 1; if). 
The reduction is an unbounded conjunctive reduction, namely, the conjunction of all word 
problems u gh = hg" , as h ranges over Fix G ({l, #}{0, 1, #}*), where G = G^ d3 (0, 1; #). 

However, Proposition 15. 13l below implies that pFix G ({l, #}{0, 1, #}*) is isomorphic to G = 
G™° d3 (0, 1; if). This and the fact that G is finitely generated (proved in Proposition l6.4|) implies 
that only the finitely many generators of pFix G ({l, #}{0, 1, #}*) need to be used in the role of 
"/i" in the Commutation Test. This then yields: 

Corollary 5.12 The circuit equivalence problem reduces to the word problem o/G™° d3 (0, 1; if), 
and hence to the word problem of G^i (over an infinite generating set), by a polynomial-time k- 
bounded conjunctive reduction. Here, k is the minimum number of generators o/G™° d3 (0, 1; if). 

Proposition 5.13 For G = G™° d3 (0, 1; if), the subgroup pFix G ({l, #}{0, 1, #}*) is isomor- 
phic to G. 

Proof. An element <p e G = G^ d3 (0, 1; #) belongs to Fix G ({l, #}{0, 1, #}*) iff cp has a table 
of the form 

T 1 # Oxx ... 0x n . . . 0x^# " 

^ L 1 # Oyi ■ ■ ■ OVn 0y[# . . . 0y' m # 

where Xi, yi, x'j, y'j range over {0, 1}*, and \x{\ = \yi\ mod 3 (for i — 1, . . . , n). The isomorphism 
to G^ d3 (0, 1; #) simply maps this table to 

; X\ . . . X n Xyf ••• ^m^f 

L 2/1 • • • yn y[# ■■■ y'm# _ 

It is straightforward to see that ip preserves lengths mod 3 on {0, 1}* if <p does, and that (p i— > i/j 
is an isomorphism. □ 

The commutation test not only works for certain fixators in G™° d3 (0, 1; if), but also for the 
analogous fixators in G^i, G™° d3 , and G™° d3 (0, 1). This is proved in the Appendices A2 and 
A3. 

Our next task will be to reduce this non-standard word problem of G^i (over an infinite 
generating set) to the word problem of a finitely generated group; we will actually obtain a 
finitely presented group. 



32 



6 Finite presentations 



We will now prove that the groups £73,1(0, 1; #) and G 3 n ° d3 (0, 1; #) are finitely generated, and 
in fact finitely presented. Higman's technique (see pp. 24-33 of [Hj) can be applied rather 
directly to these groups, once we have proved certain properties of prefix codes. We will use 
Higman's notation 



Xi . 


. X 


yi ■ 


■ y 


Zl . 


. z. 



for a composite of the form 

2/1 • • • y n 

Z\ ... z n 

where {x±, . . . , x n }, {yi, . . . , y n }, {z±, . . . , z n } are three maximal prefix codes of cardinality n. 
A remark on terminology: Higman uses the word "depth" of a prefix code to refer to the 
number of vertices of the inner tree (he has a different point of view, and does not talk about 
prefix codes or trees explicitly). We will not follow Higman's terminology and use the word 
depth for the actual depth of a tree, i.e., the number of edges in a longest path from the root 
to a leaf. 

We first give a lemma concerning the particular maximal prefix codes used in £3,1(0, 1; jf). 
Recall that, for an alphabet A, the tree of the free monoid A* consists of the vertex set A* and 
the edge set {(w, wa) : w G A*, a G ^4}; the tree is rooted, with the empty word e as the root. 
For a prefix code P C A*, the prefix tree of P consists of the vertex set 

{w G A* : w is a prefix of some element of P}, 

with root e. The edge set is 

{(w, wa) : a G A, wa is a prefix of some element of P}. 

So the elements of P are the leaves of the prefix tree of P. The inner (or internal) vertices 
of a rooted tree are, by definition, the vertices that are not leaves (i.e., a vertex v is internal 
iff there exists an edge (v, w) in the tree, for some vertex w). The tree spanned by the inner 
vertices is called the inner tree. We will denote the inner tree of the prefix tree of a prefix code 
PbyT in (P). 

Lemma 6.1 (0) Every finite maximal prefix code P over an alphabet A (e.g., A = {0, 1, ) 
has cardinality \P\ = 1 + (\A\ — l)ip, where ip is the number of inner vertices of the prefix 
tree of P. 

If \P\ > 1 then P contains a subset of the form uA (for some word u G A*). 
Also, for every integer i > 0, there exists a maximal prefix code P over an alphabet A of 
cardinality 1 + (\A\ — 1) i. 

(1) If P C {0, 1}* {e, jf\, and \P\ > 1, then P contains a subset of the form 

u {0, 1, for some u G {0, 1}* 

(2) For every integer i > 3 there is a maximal prefix code P C {0, 1}* {e, with \P\ = l+2i, 
and with the following property: 



X\ . . . x n 

2/1 • • • y n 
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P contains a subset of the form {u, v} {0, 1, #} ; for some u, v G {0, 1}* ; u ^ v. 

(3) For every integer i > 5, there is a maximal prefix code P C {0, 1}* {e, #} ; with \P\ = 
1 + 2i, and with the following property: 

P contains a subset of the form {u, v, w} {0, 1, for some u,v,w G {0, 1}*, 
with u,v,w distinct two-by-two. 

Proof. The proof is in the appendix dedicated to properties of prefix codes. □ 

For elements of G3.1 that preserve length modulo 3, the following concept and lemma are 
important. 

Definition 6.2 Let P C A* be finite set, where A is any finite alphabet. The mod 3 cardinality 

of P is the triple (n , ni, t^) G N 3 , such that (for 2 = 0,1,2 ): 

Hi = \P H {w G A* : \w\ = i mod 3}|. 

Note that if (no, ni,n2) is the mod 3 cardinality of P then no + n\ + n<i = \P\. 

Observation: By Lemma lfo.ll (1). if a prefix code Q C A* has cardinality 2 or more, its inner 
tree T- m (Q) has at least one leaf. Moreover, if \Q\ is large enough then either T iQ (Q) has a 
second leaf, or it has two (or more) one-child vertices, both having equivalent depths modulo 
3; for this to hold, it suffices that T iQ (Q) has depth > 4. More generally, if \Q\ is large enough 
then T in (Q) has one of the following: 

(1) either T m {Q) has three leaves (or more); 

(2) or it has two leaves and two (or more) one-child vertices, both having equivalent depths 
modulo 3; 

(3) or it has one leaf, and two (or more) one-child vertices, both having equivalent depths modulo 
3, and two additional one-child vertices (or more), both having equivalent depths modulo 3. 
(For one of these three properties to be true it suffices that T in (Q) has depth > 6.) 

Lemma 6.3 . 

• Suppose that there exists a maximal prefix code Q over the alphabet {0, 1}, whose inner tree 
T- m {Q) has two one-child vertices at depths = i mod 3 (for some i G {0,1,2}/ Then there 
exists a maximal prefix code P C {0, 1}* with the same mod 3 cardinality as Q, and with the 
following property: 

there is a word u G {0, 1}* such that u ■ {0, 1} C P and \u\ = i mod 3. 

Equivalently, the inner tree of the prefix code P has a leaf at depth = % mod 3. 

• More generally, let k > 2, let ii, . . . ,ik G {0, 1, 2}, and suppose that T in (Q) has the following 
property: For every X (1 < A < k), T iQ (Q) has a leaf of depth = i\ or it has two one-child 
vertices at depths = i\ mod 3. 

Then there exists a maximal prefix code P C {0, 1}* with the same mod 3 cardinality as Q, 
and with the following property: 

there are k different words u\, . . . ,Uk G {0, 1}* such that {u±, . . . , Uk} ■ {0, 1} CP and 

\ui\ = ii, ... , \uk\ = if., mod 3. 

Equivalently, the inner tree of the prefix code P has at least k leaves that have depths respectively 
= i\, . . ., = ik mod 3. 
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Proof. The proof is in the appendix dedicated to properties of prefix codes. □ 



Lemma 6.4 (1) The group £3,1(0, 1; f) is generated by elements of table-size < 7. 
V £™ d3 (0, 1; #) is 'gem 
Hence these groups are finitely generated 



(2) The group G 3 n ° d3 (0, 1; f) is generated by elements of table-size < 61. 



Proof. (1) Higman's proof that Gn, t is finitely generated can be applied directly (see |T4*j . 
Lemma 4.2, pp. 26-27). By Lemma WTT\ (1), every element <p G £3,1(0, 1; #) of table-size 
\\(p\\ — n > 1 (in particular when n > 7) has a table of the form 

xO xl xf X4 ... x n 
Vl V2 V3 Va ■■■ Vn 

where x G {0, 1}*; a; is a leaf of the inner tree of the domain code domC(<^). The image code 
{yi, . . . , y n } also contains 3 words of the form y ix = yO, y, i2 = yl, y is = yf, where y G {0, 1}*. 
The three indices ix,i2, is are in {1, ... , n}, but any order relation between ii, i 2 , 13 is possible. 
For the relation between {1,2,3} and {ii,i2,i>z\ we have two cases, just as in [T4"] . 

Case 1 — The column index sets {1,2,3} and {^1,^2,^3} are disjoint: 

By permuting columns (if necessary) we can make (ii, 12, ^3) = (4, 5, 6); then the table of <p has 
the form 

XO Xl xf £4 X 5 Xq X7 ... x r , 

yi y% V3 yO yl y# y 7 ... y n 

If n > 7, we can apply Lemma f6.1f 2) to obtain a maximal prefix code Pi over {0, 1} with two 
leaves and with the same cardinality as domC(v?)n{0, 1}*. Then (by Lemma l4~7j) . Pi determines 
a maximal prefix code P C {0, 1}* U {0, 1}*# with the same cardinality as domC(y9). If we 
appropriately insert the code P as a row we get 



xO xl X# £4 £5 Xq X7 

uO ul m# vO vl t># z 7 

yi y% 1/3 yO yi y# y-i 



Xn 
Z n 

Un 



Thus, we can write the original element if G (^3,1 (0,1; #) as a composite of two elements of 
G^i(0, 1; jf). Each of these two factors of ip contains 3 columns in "reducible" form: Each of 
these two factors can be extended to a table of size < n — 2, obtained by replacing the three 



columns 



xO 
uO 



xl 
ul 



xH 

u4 



by the column 



x 
u 



and similarly for 



y 



Let us check that these two factors of ip belong to £3,1(0, 1; if) (and not just to £3,1). First, 
the inserted row corresponds to a maximal prefix code in {0, 1}* {e, if}. Since ip G £3,1(0, 1; 
the table of tp has the following property: Words in {0, 1}* line up (column- wise) with words in 
{0, 1}*, and words in {0, 1}*# line up (column-wise) with words in in {0, 1}*#. The inserted 
row has the same size as the table of ip, and for maximal prefix codes in {0, 1}* {e, if} we know 
that the cardinality of the code determines the number of words in {0, 1}* (or in {0, 1}*#); see 
Lemma f4. 71 Thus we can correctly line up the elements of the new row with the two rows of 
(p, in such a way that the two factors belong to £3,1(0, 1; if). 

CASE 2 — The column index sets {1,2,3} and {11,12,13} have a non-empty intersection: 
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Then, we use Lemma 16.11 (2) to create a code, and insert it into the table of <p as two rows, 
exactly as on p. 27 of 

xO xl x# Xi 

uO ul u# 

. . . UCL\ . . . ua 2 . . ■ Uds 

_ ... yai ... ya 2 ... ya 3 

where (01,02,03) is a permutation of (0,1, ua\ is in any column from 1 through 3 (not 
necessarily in column 2, as drawn on the picture); ua 2 is in any column from the one just right 
of the column of uai through n — 4, and ua 3 is in any column to the right of column ua 2 through 
column n — 3. This case is possible whenever n is large enough so that there are 3 copies of the 
triple (0, 1, if) with one overlap: n = l + 2i>3-|A| — 1=3-3 — 1 = 8, i.e., n > 9 (where 
A = {0, 1, #}). This will lead to a factorization of <p G G^,i (0, 1; #) as a composition of three 
elements, each of which can be extended to a table of size < n — 2. 

As in case 1, the two new rows can be inserted so that columns are be lined up in such a 
way that the three factors belong to (^3,1(0, 1; #) (and not just to G^i). 

The Lemma now follows by induction on the table-size. Elements of table-size < 9 are then 
used as generators. Since over an alphabet of size 3, maximal prefix codes have size 1 + 2i, it 
follows that the generators of table-size < 9 actually have table-size < 7. 

(2) The proof that G^ d3 (0, 1; #) is finitely generated follows the same outline as the proof 
for £3,1(0, 1; The only difference is that now we have to check that the factors are in 
G^ d3 (0, 1; #), not just in G 3 ,i(0, 1; #). Let <p E G^ d3 (0, 1; #). 

CASE 1 — The column index sets {1,2,3} and {^1,^2,23} are disjoint: 

Again, ip has the form 

xO xl x# 

. yi V2 V3 yO 2/1 y# vi ■■■ y n _ 

where x, y, X4, X5, yi, y 2 G {0, 1}*, \x\ + 1 = \y±\ = \y 2 \ = j mod 3, and \y\ + 1 = \x^\ = \x$\ = i 
mod 3. 

Let Q = domC(<^) and let Q\ = domC(<^) fl {0, 1}*. Then x labels a leaf of T in (Qi). 
Moreover, £4,2:5 are either the children of a leaf of T in (Qi) or they are the children of two 
one-child vertices, both having equivalent depths modulo 3. So we can apply Lemma 16.31 in 
order to obtain a maximal prefix code Pi C {0, 1}* with the same mod 3 cardinality as Qi, 
such that Tin(Pi) has a leaf at depth = j — 1 and a leaf at depth = i — 1 mod 3. So, P\ has 
the form Pi = {u0, ul, vO, vl, . . .}, with \u\ + 1 = i and \v \ + 1 = j mod 3. By Lemma \4.7\ this 
uniquely determines a maximal prefix code P = P± U P 2 j^ over {0, 1, (where P2 C {0, 1}* 
consists of the strict prefixes of elements of Pi). 

Now, as in proof (1) for G 3i i(0, 1; #), we insert the code P as a row into the table of <p. We 
line up the colums as in case 1 of (1). The columns of the table can be lined up so that the 
factors of ip are in G™° d3 (0, 1; #); indeed, Pi, domC(y9), and imC(</?) have the same mod 3 
cardinality, and \u\ + 1 = \x\ + 1 = \yi\ = \y 2 \, \v\ + 1 = \y\ + 1 = \x±\ = |x 5 | mod 3. 

Case 2 — The column index sets {1,2,3} and {h, 12,13} have a non-empty intersection: 



•En— 3 -En— 2 X n — \ X n 

vO vl v# 

vO vl v# 

■ ■ ■ Un-2 Vn-1 Vn 
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Again, let Q = domC(ip), Qi = domC(<^) n {0, 1}*; also, x labels a leaf of T in (Qi). We 
assume that n is large enough in order to make sure that T in (Qi) has either another leaf or 
two one-child vertices, both having equivalent depths modulo 3. If the depth of T in (Qi) is 
at least 4 then this will be the case, and we can apply Lemma l(i.3l In order to make sure 
that T in (Qi) has depth > 4 we assume that \Qi\ > 2 5 , and this is equivalent to assuming 
n = |domC(»| = |Qi U Q 2 #| = 2 |Qi| - 1 > 2 2 5 - 1. (Recall the Q 2 C {0, 1}* consists of all 
strict prefixes of elements of Qi, hence \Q 2 \ = \Qi\ — 1.) Thus, we assume n > 2 6 — 1 = 63. 

Now we insert the new code P twice into the table, in the same way as in case 2 of (1). 
Elements of odd table size < 63 can thus be used as generators. □ 

Next, we want to prove that G 3) i(0, 1; #) and G^ d3 (0, 1; #) are finitely presented. Following 
Higman [T3] (p. 25), we associate a table with a relation in G^^O, 1; #) or G™° d3 (0, 1; #). Let 
us fix a finite generating set for 6*3,1(0, 1; #), and let ip%, . . . ,ip n be a sequence of generators. 
By restriction of the generators, we can choose a table for each generator in such a way that the 
image code of (pi is equal to the domain code of Pi+\ (1 < i < n). Then all these domain and 
image codes have the same cardinality, say m. Putting these n tables together in an (n + l)xm 
table yields the table of the sequence (pi . . . (p n : 



Xl,l ■ 








x n+l,l ■ 


•£71+1, 771 



where the following is a table for ifi (1 < i < n): 

■ ■ ■ -^1,171 
• • • Xi-\-X,m 

Note that ip\ . . . <p n is a relator of 63,1(0, 1; #) iff x%j = x n+ xj for all j = 1, . . . , m (i.e., the 
first and the last rows are equal). 

The smallest m for which a sequence (or, in particular, a relator) (pi, . . . , (p n has a table, 
is called the table-size of the sequence (or the relator). 

The concepts of "table of a relator" , and "table-size of a relator" make sense for any sequence 
(pi,...,ip n of elements of 63,1, or in particular of G 3 n ° d3 (0, 1; 

Thanks to this concept we can formulate the previous Lemma .41 in a slightly stronger way 
(similar to Higman's Lemma 4.3). 

Lemma 6.5 Every element (p G 63,1(0, 1; jf) of table-size > 7 can be represented by a 
word over the set of elements of table-size < 7, such that the sequence w v has table-size 

<IMI- 

Similarly, every element p £ G 3 n ^ d3 (0, 1; #) of table-size \\<p\\ > 61 can be represented by a 
word Wtp over the set of elements of table-size < 61, and such that the sequence has table-size 

<IMI- 

Proof. This follows from the proof of Lemma [6.41 In that proof, we started out with a table 
of <p (of table-size \\<p\\), and repeatedly inserted rows. No columns are ever added, hence the 
table-size doesn't increase. See also the proof of Higman's Lemma 4.3 in □ 



37 



Proposition 6.6 The group G^,i(0,l;if) is presented by relators of table-size < 9, in terms 
of generators of table-size < 7. 

The group G™° d3 (0, 1; if) is presented by relators of table-size < 125 in terms of generators 
of table-size < 61. Hence G3,i(0, 1; if) and G™° d3 (0, 1; jf) are finitely presented. 

Proof. Higman's method for proving that Gjv> is finitely presented can be applied directly 
(see [H], pp. 29-33). Now we use part (3) of Lemma I6~T1 for G 3 i(0, 1; #), and Lemma lfi~3l for 
G^ d3 (0,l;#). 

For the same reason as in Lemma 16.41 the new rows that are inserted can be lined up 
(column-wise), in such a way that all pairs of adjacent rows represent elements of (^3^(0, 1; if) 
or G™f 3 (0, 1; #) (and not just of G 3A ). 

The number 9 for £73,1(0, 1; if) comes from the fact that, in order to do the row insertions 
the table-size n = l + 2i has to be at least 4x|A| - 2 = 4x3-2 = 10 (where A = {0, 1, #}). 
Hence i > 5, hence n > 11. So, for the generators we can pick table-size < 11 (which implies 
table-size i < 9, since over a three- letter alphabet table-sizes are odd). Refer to p. 31 of 
(the "linkages between them" occupy at most 4 \A\ — 2 columns). 

For G 3 n ° d3 (0, 1; Higman's "type III reductions" require that we insert a row correspond- 
ing to a prefix code with 3 leaves in the inner tree. Since one of the pre-existing rows in the 
table already has two leaves, we need the table-size to be large enough so that the Observation 
before Lemma 16.31 applies. If T m (over {0,1}) has depth at least 5, and T m has at least two 
leaves, then it either has 3 (or more leaves) or it has at least two one-child vertices such that 
the depths of these two vertices are equivalent mod 3. In the latter case we apply Lemma IB~3l to 
obtain the desired code. For T in to have depth 5, it is sufficient for the code (over {0, 1}) to have 
cardinality 2 6 . Hence (by Lemma f6.3|) . the code over {0, 1, if) has cardinality 2 x 2 6 — 1 = 127. 
So the presentation of G 3 n ° d3 (0, 1; if) uses tables of size < 127, hence of size < 125 (since code 
sizes over a 3-letter alphabet are odd). □ 



X\ . . . x n 

yi ■■■ y n 



The group ^3^(0, 1; 4f) maps onto G% i by the homomorphism 
X\ ... X n Xi~j^ . . . x m ^ z 

yi ■■■ y n y[# ■■■ y' m # 

whose kernel is the normal subgroup pFix G ({0, 1}*) of G = G3,i(0, 1; #) (by Lemma l4~T1 this 
partial fixator is indeed a group). The group pFix G ({0, 1}*) consists of the elements that have 
a table of the form 

x 1 ... x n x[# ... x' m # 
x 1 ... x n y[# ... y' m # 

Hence, G3,i(0, 1; #) is not a simple group. 

In a similar way, G 3 n ° d3 (0, 1; #) maps onto G = Gf° d3 with kernel pFix G ({0, 1}*); hence 
G 3 11 ° d3 (0, 1; #) is not a simple group. 

In summary, we proved: 

Theorem 6.7 The group G 3 n ° d3 (0, 1; #) is finitely presented, and not simple. 

The word problem o/G 3 n ° d3 (0, 1; #) ; over the infinite generating set A U {r iti+ i : < i}, is 
coNP-hard with respect to constant- arity conjunctive polynomial-time reduction. (Here A is a 
finite generating set o/G 3 n ° d3 (0, 1; if).) 
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7 Reduction to the word problem of a finitely presented 



group 

So far, the word problems that we have focused on were over infinite generating sets, although 
the groups used also admit finite generating sets. This is a crucial point, because the groups 
G31, etc., have their word problem in P over a finite generating set; but their word problem 
over certain infinite generating sets, as seen here, is coNP-hard. 

In this section we obtain different Thompson groups with finite generating sets. These 
groups are obtained by expressing the transpositions r njn+ i over a finite set of generators, 
according to tbe methods of Section 2; we saw that r ni „ + i has polynomial word length (in n) 
over those generators. Thus Section 2 now gives us a finitely generated Thompson group with 
coNP-hard word problem. We will work next at obtaining a finitely presented group. 

Proposition 7.1 Conjugation by K; L or k7 l x (i = 0, 1, 2, 3) is an automorphism of G™° d3 (0, 1; 
and also an automorphism o/G™ d3 (0, 1). 

Proof. It is enough to prove that G™° d3 (0, 1; #) and G™ d3 (0, 1) are closed under conjugation 
by Ki and by kT 1 - 

The definition of Ki directly shows that Ki and kJ 1 stabilize {0, 1}* and {0, Hence for 
every tp G G^ d3 (0, 1), K iV Kj l and k~V«* stabilize {0, 1}*; and for every <p G Gj£? d3 (0, 1; #), 
k^kT 1 and K~ X LpKi stabilize {0, 1}* and {0, 1}*#. 

The definition of k% also directly shows that Ki is length-preserving. Hence, or every tp G 
G™° d3 , k^kT 1 and kJ 1 ^^ preserve length of strings in {0, 1}* modulo 3. 

Thus, all we still need to show is that if <p G G™° d3 (0, 1), then k^kT 1 and K~ x <p)Ki belong 
to G 31 . We will do this by showing that they have "finite depth". An element ip G Qz,\ is said 
to have depth < d iff for all w G {0, 1, #}* with > d, there is a prefix v of w — vs (for some 
s G {0, 1, #}*, with |t> I < d and ip(w) = ip(v) s. Obviously, ip belongs to G 3) i iff tp has finite 
depth. 

Let (p G G 3 n 5 >d3 (0, 1). Since k^k^ 1 stabilizes {0,1}*, it has domain and image codes of 
the form P 1 U {J v€ p 2 v # P(f ), where Pi (c {0, 1}*) is a maximal prefix code over {0, 1}, P 2 
(c {0, 1}*) consists of all strict prefixes of elements of P 1; and each P(f ) is a maximal prefix 
code over {0, 1, In order to show that k^k^ 1 and K~ x <p)Ki belong to G^i, we have to show 
that Pi is finite (hence P2 is finite), and that each P(v) is finite, as v ranges over P2). Let 

m = max{|w| : w G domC((^) U imC(</?)}, 

d = i + 3 • [m/3] (i.e., 3 • [m/3] is "m rounded up to the next multiple of 3"). 
We claim: 

KiLpn~ l and K~ l LpKi have depth < d. 
For w G {0, 1}*, if |tu| > d we can write w = xs G Pi{0, 1}* C {0, 1}*, with \x\ = d. Then 

Ki(xs#) = Ki(x) Kq(s) #, 

by the choice of d, and since |x| = d. Next, applying ip yields 

(p(Ki(x)) k (s) #, 
since = |x| > m. Now, applying k~ 1 yields 

Ki\ip(Ki{x))) Kq\kq{s)) #, 
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since \(f(Ki(x))\ = |«i(x)| (mod 3) = d (mod 3). Important remark: Here we used the fact that 
<f preserves length modulo 3 (on {0, 1}*). 

Thus we have: 

for all xs G {0, 1}* with \xs\ > d, \x\ = d. 

For v^st with v G P2, st G P(v){0, 1}*, and \v#s\ < d we have 

ip(Ki(v#st)) = ip{Ki(v) #st) = ip{Ki(v) #s) t; 

the last equality holds because |f#s| < d. Note that ip(ni(v) #s) contains at least one copy of 
#, since p G G™ d3 (0, 1), i.e., ip{Ki(v)#s) = y#z for some y G {0, 1}*, z G {0, 1, #}*. 

Therefore, when we apply n~ x we obtain 

Kj\ V {Ki{v) #5)) i = ^(y^zt. 

This shows that K^ ipK^ has depth < d. Hence, Pi, P2, and all P(u) are finite. For k^k^ 
the proof is the same. □ 

As a consequence of Proposition 17. II we can consider the following HNN-extension: 

#(0,1; #) = (G^ d3 (0,l;#)U{t} : {tgt^ 1 = g K:i21 : g G G™° d3 (0, 1; #)}). 

Since G™° d3 (0, 1; #) is finitely generated, the HNN- relations form a finite set; moreover, since 
G™° d3 (0, 1; #) is finitely presented, the whole HNN-extension is a finitely presented group. 

This HNN-extension is rather special, since the group being extended is the same as the 
group being conjugated. Therefore, the normal form of elements of the HNN-extension H (0, 1; jf) 
is 

gt n , where n G Z and g G G^ d3 (0, 1; #). 
It follows that this HNN-extension is a semidirect product: 

#(0,1; #) = G£? d3 (0,l;#) x Z. 

Lemma 7.2 TTie HNN-extension #(0, 1; zs isomorphic to the subgroup (G^° d3 (0, 1; U 
}) of the group Q 3:1 . 

Proof: By the normal form theorem for HNN extensions, the mapping defined by t f— > k 
determines a surjective homomorphism from #(0, 1; jf) onto (G™° d3 (0, 1; jf) U {k}). 
Here we abbreviate k 32 i to k. 

In order to show that the map gt n 1 — > gK n has trivial kernel, suppose by contradiction 
that for some n 7^ 0, an element 7? = gn n is the identity. 

Since g G G^i, it has finite domain and finite image codes. Let £ be an upper bound on the 
longest length of any element in the domain code and the image code of g. Let B be an integer 
such that B > 6 |n|, and B > 2£. 

Let x G {0, 1}* be of length > 3B, and let us apply ip to the argument The map 

g G G™° d3 (0, 1; can change at most £ bits of the argument, or shorten or lengthen the 
argument by < I bits. The map K n moves bits over a distance < 6 \n\. Therefore, the effect of g 
on the argument xj^ is only felt on the leftmost B = 2 £ + 6 |n| bits of the argument. Further 
to the right inside x#, only n n has an effect. So we can write x as x = ps with p, s G {0, 1}*, 
\p\ = B; then (p(psjf) has the form 
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<p(ps#) = pV# 

for some p', s' G {0, 1}*, with \p'\ < B. Most importantly, s' is changed (according to K n ) at 
every bit position, except perhaps in the rightmost 6 \n\ (< B) bits. Since we chose |x| > 3B, 
we conclude: (p changes x#. So (p is not the identity map. The completes the proof by 
contradiction. □ 

In summary, so far we have proved the following. 

Theorem 7.3 There exists a finitely presented Thompson group G (c with the following 

properties: 

• The word problem of G (over a fixed finite generating set) is coNP-hard, with respect to 
polynomial-time constant- arity conjunctive reduction. 

• G is an HNN extension (by one stable letter) of some finitely presented subgroup Th of G% \. 
In fact, G is isomorphic to the semidirect product Th x Z. 

An example of such a group G is the subgroup (G*3 n ° d3 (0, 1; jf) U {^321}) of Q31, where Th 
* s G™ d3 (0,l;#). 

Proof. We use k 32 i to replace the transpositions r„ ;n+1 by words over A U {^321} of linear 
length (according to Lemma l2.2j) : here A is a finite generating set of G™° d3 (0, 1; Now 
the previously seen reductions reduce the circuit equivalence problem to the word problem of 
(G^ d3 (0,l;#)U{K 32 i}). □ 

In the next Section we will show that the word problem of (G™° d3 (0, 1; #) U {^321}) is in 
coNP, thus showing that this word problem is coNP-complete. 

8 Complexity of some word problems 

Consider the following subgroups of the Thompson-Higman group Qz,x- 

#(0,1) = (G^ d3 (0,l)U{ K32 i}), 
#(0,1; #) = (G^ d3 (0,l;#)U{K 32 i}), 
(G 3i i U {k 32 i}), and 

(G 3) i U {Ko,«l,«2}) • 

We see from the definition of and K3 that they differ only by a finite permutation; hence 

(G 3j i U {/Co, Kl, «2» = (G 3 ,lU{«o,/Ci,«2,K 3 }). 

Before we analyze the word problem of these groups we need a result about the permutation 
group (70,71,72) (of permutations of N), and about the subgroup {kq, Ki, K2) of Q^,i- For 
7r G (70,71,72) we denote the word-length of n over {7o,7i,72} ±:L by \n\; similarly, for K G 
(k , ki, k 2 ), the word-length of K over {k , Ki Kz}^ is denoted by \K\. 

Lemma 8.1 Let 71 G (70, 71,72)- Then for alln G N with n > 2 \tt\ + 1 : 7r(n + 3) = n(n) +3. 
Hence the displacement function n f— > 7r(n) — n is ultimately periodic, with period 3 ; when 
n > 2|d + I. 
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As a consequence, tc 7^ 1 iff ir(n) 7^ n for some n < 2 \n\ +3. Similarly, for if G (/to, «i, K2) 
we have: if 7^ 1 z/f £/jere exists x G {0, 1}* |x| < 6 \K\ + 3 ; snc/i £/ia£ K{xjf) 7^ 

TTie word problems of the groups (70,71,72) (^0,^1,^2) can fre decided deterministically 
in quadratic time. 

Proof. From the definition of 70, 71, and 72, one sees immediately that 7«(n + 3) = 7i(n) + 3, 
for all n > 3, % = 0, 1, 2. For ix G (70, 71, 72), the relation 7r(n+3) = 7r(n) + 3 (when n > 2 | vr j — I— 1) 
follows by a straightforward induction on \ir\. Indeed, 7jVr(n + 3) = 7j(7r(n) + 3) = / ~f i ir(n) + 3, 
if n > 3 and 7r(n) > 3. Moreover, since each 7, can decrement its argument by at most 2, we 
have n(n) > 3 if n > 2 | -zr j + 3 = 2 |^7r | + I. 

Let 7r G (70,71,72)- To check whether ir — 1, we compute the 2 | vr | + 4 numbers 7r(n) 
with < n < 2 |7r| + 3, and check whether 7r(n) = n. Let it = nj, . . . 7Ti, with 7Tfc, . . . ,7Ti G 

{7o, 7i, 72} ±:L - To compute n(n) we successively compute 717(71), 7r 2 7r 1 (n), , ix,j . . . 7Ti (n) , 

, 7Tfe . . . 7Tj . . . 7Ti(n). For this, all we need is a deterministic push-down automaton, whose 

input tape contains the word (71^, . . . , 717); inputs are read from right to left. After reading 
(■Kj, . . . , 7Ti) with k > j > 1, the machine's stack contains the number 7Tj . . . ni(n) in unary, and 
the machine's internal state remembers 7Tj . . . 77(77) mod 3. To apply itj+i to 7Tj . . . 7Ti(n), the 
machine only needs to know ttj . . . 77(77) mod 3, and it needs to know whether ttj . . . 717(77) is 
equal to 0, 1, 2, or > 2. Since a push-down automaton has linear running time, n{n) can thus 
be computed in time O(n) (< 0(|7r|). Since < n < 2 \rr\ + 3, the total time to compute 7r(0), 
tt(1), tt(2 |tt| + 3) is 0(|tt| 2 ). 

For if G (/to, «i, K2) an d £ {0, the action of if on x# permutes the bits of the 
bitstring x. Note that Ki permutes the bits of x# in the same way as 7" 1 permutes the bit 
positions, except near More generally, when |x| > 2 |if |, the action of if on x# permutes 
the bits of x in the same way as hk G (70, 71, 72), except perhaps for the right-most 2 \K\ bits of 
x (near #); here 7Tr- is obtained from if by replacing every Ki by 7" 1 (i = 0, 1, 2). Indeed, every 
^ in if differs from the corresponding 7" 1 at most on the 2 bits near this effect propagates 
I if I times, to a distance < 2 |if | from 

If if 7^ 1, then either tt^ 7^ 1, or if is a non-identity permutation on the right-most 2 |if | 
positions of some words x# G {0, 1}*#. Note that |7r^| < |if|. When |x| > 6 |if | + 3, the 
action of if on xj^ consists of applying n K on x, except for the right-most 2 |if | bits. Thus, if 
7ix 7^ 1, we can check this on the left-most 4 |7Tk-| +3 ( < 4 |if | + 3) bits of x#; if = 1, we 
can check that if is a non-identity permutation on the right-most 2 |if | positions by inspecting 
these 2 I if I positions. Therefore, if if 7^ 1, there is a position n < 6 |if | + 3 which is permuted 
non-identically by if. Therefore, to decide the word problem for if G (k , ki, k 2 ) we can proceed 
as for (70,71,72), above, but we check how if permutes all n with n < 6 |if| + 3 (instead of 
<4|vr| + 3). □ 

Theorem 8.2 The word problem of (0 3 ,i U {^0,^1,^2}); and hence of H(0, 1), ii(0, 1;#) ; 
and (03,1 U {^321}), are in coNP. 

Proof. Since 3 n ° d3 (0, 1) and 3 n ° d3 (O, 1; are finitely generated subgroups of 03,1, and 
(03,i U {^321}) is a finitely generated subgroup of (0 3 ,i U {k , ki, k 2 }), it is sufficient to show 
that the word problem of (0 3jl U {k , k x , k 2 }) is in coNP. Indeed, it is a general fact that 
if a group's word problem has a complexity < f(n) (regarding time of space, deterministic, 
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nondeterministic, or co-nondeterministic) , then every finitely generated subgroup has a word 
problem of complexity < /(cn), for some positive constant c (see [TH], and (3]). 

Let A3.1 be a finite generating set of Gs^. We will prove (in the Claim below) that if a 
word w over the generating set Af \ U {ko, «i, «2} ±:l is not the identity then there exists a 
word x G {0, 1, #}* of length |x| < c|w| (for some constant c), such that w(x) is defined and 
w(x) 7^ x. 

Therefore, a nondeterministic algorithm for the negated word problem of ((j3,iU{ko> k i> ^2}) 
simply needs to guess x, then compute w(x), then check that 2 7^ w(x). Guessing a; takes linear 
time (since |x| < c |iu|). Applying an element 5 G Af{ to a word z G {0, 1, #}* takes constant 
time (since 5 just changes a bounded-length prefix of z), and changes the length of z by an 
additive constant: \S(z)\ < \z\ + c. Applying nf l (i = 0, 1,2) to z will not change the length 
of z and takes linear time (< c\z\). Finally, since |w(x)| < c\w\ for some constant c), one can 
check in linear time whether x 7^ w(x). So the Theorem will follow from the following Claim. 

CLAIM: Let w G (A3 \ U {k , k±, k^}* 1 )* be such that as an element of C^i, w is not the 
identity. Then there exists x G {0, 1, #}* such that w(x) is defined, x 7^ w(x), and \x\ < c \w\. 

Proof of the Claim: Let I be the length of the longest word in the domain and image codes 
of the elements of A3.1. 

The word w is of the form w = g n K n g n _iK n _i ■ ■ ■ giK^o, where g n , . . . , gt, g G (A^)*, 
and K n , . . . , Ki G ({kq, «i, 1^2}^)* ■ Since w does not represent the identity, there exists a 
word z G {0,1, #}* such that z 7^ 10(2;). We can assume that z is long enough (indeed, 
w(zZ) = w(z) Z 7^ zZ for any word Z G {0, 1, #}*; so we could replace z by and thus make 
z as long as we wish). So we can assume that \z\> 3N, where N = £ J2j=o 1^1 + 6 J2j=i 
(< (£ + 6) \w\). Let pqr be the prefix of length 3iV of z, where \p\ = \q\ = \r\. We will show 
that x = pqr^ satisfies w{x) 7^ x. 

The first (i.e., the right-most) generator in go affects only the left-most i letters of z. Since 
the right-most letter in g could shorten z by up to I — 1, the right-most two letters of go could 
affect at most the first 2£ letters of z. In total, go can affect the left-most i \g§\ (or fewer) letters 
of z. 

Next, K\ moves each bit of go(z) over a distance < 6 \Ki\. So, Kxg changes the left-most 
6 \K\\ +£ \go\ (or fewer) letters of z (in ways that we will not try to specify). The letters further 
to the right in go(z) (at positions > 6 \K\\ + £ \go\) are permuted by K\ iff # does not appear 
within the left-most 6 \K\\ +£ \g \ positions of go(z). Note that since w(z) is defined, go(z) must 
contain some # (otherwise, K\ would not be defined on go(z)). 

For the same reason, w changes the left-most iV = 6 Y^=i l-^il + ^Y^=o ( or f ewer ) 
letters of z in fairly arbitrary ways; those are the positions in the prefix p of z. The letters 
further to the right in z (at positions > N) are only permuted according to some of the X m 's 
(n > m > 1), namely for those m for which g m ^\K m _\ . . . g\K\g^z) does not contain # within 
the iV leftmost positions. Let K G {kq, k 1? k 2 ) be the concatenation of those K m (m = n, . . . , 1) 
for which there is no # in g m _\K m _\ . . . gxK\go(z) within the iV leftmost positions. 

Since w changes z, it either changes the prefix p of z, and in that case, w will of course also 
change x = pqrj^. Or w does not change the prefix p, but K permutes bits at positions > iV 
in z, non-identically. Moreover, by Lemma f8. 11 if K acts non-identically at a position i + 3 of 
z, with N < i, and \z\ > N + 4 \K\, then K also acts non-identically on position i of z. Thus, 
acts non-identically on a position i of z, with iV + 3 < i > N. Then w changes p, hence x. This 
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proves the Claim, and hence the Theorem. □ 

The main theorem of the previous section now becomes: 

Theorem 8.3 There exists a finitely presented Thompson group G (c Gs,i), with the following 
properties: 

• The word problem of G (over a fixed finite generating set) is coNP-complete (with respect to 
polynomial-time constant- arity conjunctive reduction). 

• G is an HNN extension (by one stable letter) of some finitely presented subgroup Th of ' G^\. 
In fact, G is isomorphic to the semidirect product Th x Z. 

An example of such group G is the subgroup (G!f° d3 (0, 1; #) U {^321}) of Gz,i, where Th 
zsG^ d3 (0,l;#). 

Proof. This follows directly bv combining Theorems 17.31 and 18.21 □ 

Next we give a coNP-completeness result about finitely generated simple groups. First, 
recall the following: If Gn,i Q G C Q N1 then the commutator subgroup G' is a simple group 
(see R. Thompson's comment before Corollary 1.11 in [ST]; an actual proof of this claim and a 
generalization to the Thompson-Higman groups (jjv,i was given by E. Scott, Lemma 20 in |27j). 
Note the symbols "C" in the result; it is not sufficient that G contains a copy of Gjv,i and Gn,i 
contains a copy of G, but the copy of Gn,i inside G must be identical with the subgroup Gn,i 
of Q N) i. 

When if is a subgroup of a group of G, recall the Reidemeister- Schreier rewrite process (see 
e.g., [TH] pp. 90-93, [T7j pp. 102-104, [23] pp. 69-78). The graphical form of the process is quite 
intuitive. One first takes the Schreier graph, whose vertex set is the set of cosets Hgi, where 
<7i (i — 1, . . . , k) are coset representatives (we only use the case when k is finite). The set of 
(labeled) edges of the Schreier graph is {Hgi — > Hgia : a G A, % = 1, . . . , k}, where A is a 
generating set of G. We will only consider the case when A is finite. 

Hence, when G is finitely generated and H has finite index in G then the Schreier graph 
is a finite automaton (if we pick the coset H as both start and accept state), which decides 
the generalized word problem of H in G (deterministically in linear time). We also have the 
following interesting properties, assuming H has finite index in G: If G is finitely generated 
then H is finitely generated; if G is finitely presented then H is finitely presented (see the 
above references). Moreover, the Reidemeister-Schreier rewrite process shows that when G is 
generated and H has finite index in G then the distortion of H in G is linear. 

Let us pick a spanning tree in the Schreier graph, with root H; this is the graphical way of 
choosing a Schreier transversal: for every vertex Hgi let t{ G (A^ 1 )* be the label of the path 
in the spanning tree from the root H to Hgf, then the word U represents an element of Hgi, 
so we can write Ht t for Hg{, let T = {ti : i = 1, ... , k}. For any word w G (A ±1 )*, we denote 
the coset representative of w by w (g T). The following set, called the Reidemeister-Schreier 
generators, generates H: R = {ti a (t^a) -1 : t; L a G" T ', a G A^, i = 1, . . . , k}. 

We need an auxiliary result: 

Proposition 8.4 Suppose G is a finitely generated group, and H is a subgroup of G of finite 
index. Then the word problems of G and H are reducible to each other by linear-time many-to- 
one reductions. 
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Proof. Recall that by the Reidemeister-Schreier rewrite process, H is finitely generated; let 
R be a finite generating set of H. Hence, the identity embedding of H into G is a one-to-one 
reduction of the word problem of H to the word problem of G. The reduction just consists of 
expressing each generator in R by a string over A in a fixed way, so this reduction has linear 
time complexity 

Conversely, let us reduce the word problem of G to the word problem of H. Let ho be some 
fixed word over the such that h ^ 1 in H. A function that reduces the word problem of G to 
the word problem of H can be defined by 



Here, (w)r denotes the expression of w over the Reidemeister-Schreier generating set R of H 
(when w G H). By the Reidemeister-Schreier rewrite process, (w)r can be obtained from w in 
linear time. Since the generalized word problem of H in G is decidable in linear time (using 
the Schreier graph automaton), it follows that the above reduction function is computable in 
linear time. Finally, w — 1 in G iff f(w) = 1 in H. □ 

Theorem 8.5 There exists a finitely generated simple Thompson group whose word problem 
is coNP-complete (with respect to polynomial-time constant- arity conjunctive reduction). 

An example of such a group is (G 3i i U {k , Ki, k 2 })' , i.e., the commutator subgroup of 

(G 3jl U {/Co,Ki,«2}). 

Proof. The group G = (G^iU-fKo, «i, ^2}) satisfies 6*3,1 CGC Q 31 . We immediately conclude 
that the commutator subgroup G' is a simple group, by the earlier remarks on R. Thompson's 
comments. Also, G' has finite index in G. Indeed, 1 , and G' 3 x has index 

2 in G 3 i (by [Ej). Clearly, G is finitely generated (since the Thompson-Higman group G 3i i 
is finitely generated). It follows that G' is finitely generated, by our remarks above on the 
Reidemeister-Schreier rewrite process. 

By Proposition 18.41 the word problems of G and G' are reducible to each other. Hence, since 
the word problems of G is coNP-complete, the word problems of G' is also coNP-complete. □ 

We have now completed the proofs of the main theorems, which give us finitely presented 
Thompson groups, and finitely generated simple Thompson groups with coNP-complete word 
problems. To finish, let us give some more explanations of the fact that the finitely presented 
group G 3j i (over a finite set of generators) has a word problem in P (deterministic polynomial 
time), but over an infinite set of generators (obtained by including all letter transpositions) the 
word problem of G 3t i is coNP-complete. This is related to the concept of distortion. See [!"">] 
for the original definition by Gromov, and |21j . [2*2"] for a slightly more natural definition and 
some interesting results; results on distortion in Thompson groups appear in [""]; the complexity 
version of the Higman embedding theorem (in [""] for semigroups and [21"" , [7] for groups) show 
that the embeddings given there have linear distortion. 

Originally, Gromov only defined distortion to characterize the relation between a group and 
a subgroup. In the present context it is useful to also consider the self-distortion of a group, 
relative to different generating sets. In the case of G 3i i we consider a finite generating set 
A 31 and the infinite generating set A 31 U {t^+i : % > 0} (with infinitely many transpositions 
included in the generating set). The self- distortion of G 3>1 relative to B — A 31 U {r iii+1 : i > 0} 





if w G H. 
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is said to have upper bound / iff / : N — > N is a non- decreasing function such that for every 
g G G 3t i we have: \g\& < /(|^|b)- Here, \g\& denotes the word-length of g over the generating 
set A, i.e., the length of the shortest word in (A )* representing g (and similarly for |o|,b). 
The next Lemma shows that the self-distortion of G^i for the above generating set is at least 
exponential. The self-distortion of G^\ for the above generating set is closely related to the 
Gromov distortion of G 3 i within (G 3) i U {^321})- 

Definition. Two functions f\, f'2 '■ N — > N are said to be linearly equivalent iff there 
exist positive constants c , ci, c 2 , c 3 , c 4 suc/i i/iai /or a// n > c : fi(n) < c i/2(c2^) awe? 
/2O) < c 3 /i(c 4 n). 

^4 function f : N — > N at least exponential z/f t/iere zs a constant c > 1 suc/i i/ia£ /or 
infinitely many n: f(n) > c n . 

Lemma 8.6 The self- distortion of G 3) i relative to the generating set A 31 U {t^+i : i > 0} is 
at least exponential. The distortion of G 3) i in (G 3i i U {k 32 i}) is at least exponential. Similarly, 
the distortion o/G 3 n ° d3 (0, 1; jf) in H(0, 1; zs at /east exponential. 

Proof. We saw already in Lemma 12.21 that the transposition r n _i jn (where n > 0) has word 
length |r njn+ i| < 2n — 1 in (G 3) i U {/t 3 2i}), over any generating set containing 71,2 and K 3 2i. 
Moreover, we will prove next that the transpositions have exponential table size; this implies 
an exponential word length over any fixed finite generating set of C7 3jl , as we will see. 

Claim. The table-size of the maximum extension of r„_ l n is ||T n -i,n.|| = 2 n+2 — 1. 
Proof of the Claim: The domain and image code of r n _ Ln , as originally defined, are both 
equal to {0, l} n+1 U {0, However, in order to find n || we must maximally extend 

T n _i n . Recall that a bijection <p between finite maximal prefix codes is extendable iff there 
exist u, v G {0, 1, #}* such that domC(y?) contains the triple uO, ul,u# and imC((^) contains 
the triple vQ,vl,v# with ip(u0) = vO, f(ul) = vl, ip(u#) = v#. 

A triple of arguments in the domain code of r n _i n that could possibly lead to an extension 
is of the form 

where x , . . . , x n _ 2 , a? n _i G {0, 1}. The transposition r n _i >n maps this triple to the triple 

So, whether x n -\ = 1 or x n -i = 0, no extension is possible. The set {0, l} n+1 u {0, l}- n # has 
cardinality 2 n+1 + 2 n+1 — 1. This proves the Claim. 

Now, by the relation c A • ||T nn+1 || < |r„ jn+1 | A (Corollary 4.7 in [§]) for some constant c A > 0, 
depending on the choice of a finite generating set A 3 i chosen for G 3j i: 

It- \ ~> r Ht I > r 0"-+2 „ 

|'n— l,n|A — a II i—l, n|| — l, a a 

Hence, since 2n — 1 > |r„_ l n | AUK (as we already saw at the beginning of this proof), 

|r n -l,„| A > C A - 25I t «-i."Iau k + I -c A . □ 

Theorem 8.7 The distortion of G 3j i in (G 3j i U {K321}) is exponential (i.e., it is linearly 
equivalent to 2 n ). 

Similarly, the distortion o/G 3 n ° d3 (0, 1) in H(0, 1) or in (G 3) i U {^ 3 2i}) is exponential. And 
the distortion o/C7™° d3 (0, 1; #) in H(0, 1; or m H(0, 1) or m (G 31 U {k 32 i}) exponential. 
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Proof. We already saw an exponential lower bound, in Lemma 18.61 We will now prove an 
exponential upper bound, of the form c n (for some constant c > 1). 

Let A 3) i be a fixed finite generating set for G 3i i. For cp G (j3,i, let |<£>|a 3 ,i denote the 
word-length of (p over the generating set A (i.e., the length of a shortest word over Af\ that 
represents <p). Similarly, |v?|a 3 ,i,k denotes the word-length of <p over A 3ji U {k} (i.e., the length 
of a shortest word over Af\ U {k ±1 } that represents <p). 

Claim: Let w be a word over A^} U {k ±1 } that represents an element of G 3j i, and assume 
that w is in shortest form (i.e., there is no shorter word over A 3 ;| U {k ±1 }, representing the 
same group element). Then the longest entry in the table of <p has length < (6 + £) \w\. 

Proof of the Claim: Let w = g n ^ %n Qn-i^' 1 4 4 4 9i^ n 9oi where i n , ■ ■ ■ ,ii G Z — {0}, and 
9n, 4 4 4 , 9i,9o e (Ag)*. 

As in the proof of Theorem 18 .2[ let x G {0, 1, #}* be any word of length at least 3N, where 
N = £ Y^=o + 6 YTj=i \h\ (— + 6) \w\), and where £ is the length of the longest word in 
the domain and image codes of the elements of A^i. In the proof of Theorem 18.21 we saw that 
the action of w on x changes the left-most N (or fewer) letters of x in fairly arbitrary ways. The 
letters of x at positions further to the right (i.e., at positions > N) are only permuted according 
to n lsum( - x \ We have i sum (x) = 0, otherwise w would change bits at arbitrarily remote positions 
on x (for arbitrarily long words x; this would imply that w has an infinite table (contradicting 
the assumption that w represents an element of G 3i i). 

Now, since ^ sum (a;) = 0, w only changes letters at positions < N (< (6 + £) \w\) in x. 
Therefore, the longest word in the domain code of w has length < (6 + £) \w\. This proves the 
Claim. 

It follows immediately from the Claim that the table size of (^satisfies \\<p\\ < 3 ((m)H . Note 
that here, w is the word length of <p over the generating set A 3i i U {k}; i.e., \w\ = |v?|a 3i i,k- By 
Theorem 4.8 in |HJ, lvL 31 — c a ' IMI ' l°g2 llv 9 !!) where c A > is a constant. Hence, 

\<p\ A < c3 c ^«c \<p\ A ,K < C^-. 
for some constants c,C > 1. This proves the Theorem. □ 
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9 Appendix 



9.1 Properties of prefix codes 

In this appendix we prove various properties of prefix codes that are used in the paper. Recall 
that e denotes the empty word. 

Lemma 9.1 (Lemma 14. 7|) 

(1) If P C {0, 1}* U {0, 1}*# is a maximal prefix code over {0, 1, #} then P = P x U P 2 # 
for some Pi, P 2 C {0, 1}*, with the following properties: 

• P\ is a maximal prefix code over {0, 1}; 

• P2 = {p G {0, 1}* : p is a strict prefix of some element of Pi}. 
When Pi is finite, this last property implies: |P 2 | = IP I — 1- 

(2) Conversely, if P = Pi U P 2 # for some Pi,P2 C {0, 1}* with the above two properties, 
then P is a maximal prefix code over {0, 1, jf}. 

Proof. If P C {0, 1}* U {0, 1}*# is a maximal prefix code then P has the form P = P 1 U P 2 #, 
with Pi,P 2 C {0,1}*. Since P is a maximal prefix code, P x is a maximal prefix code over 
{0, 1}. Also, the set P 2 # is a prefix code for any subset P 2 C {0, 1}* (since any two elements 
P2# 7^ £>3# with p 2 ,p3 G {0, 1}* are prefix incomparable). 

Let us prove that P 2 is as in the Lemma. Since Pi is a maximal prefix code, every p 2 G P 2 
(and in fact every string in {0, 1}*) is prefix comparable with some element of Pi. Let's say, 
Pi G Pi is prefix comparable with p 2 G P 2 . If pi were a prefix of p 2 then pi would also be a 
strict prefix of p 2 #, which would contradict the fact that P is a prefix code. This shows that 
every element of P 2 is strict prefix of an element of Pi. Since P is a maximal prefix code, P 2 
consists of all strict prefixes of elements of Pi. 

It is straightforward to prove the converse, namely that every set P = Pi U P 2 #, with Pi, P 2 
as above, is a maximal prefix code. □ 

Lemma 9.2 Every maximal prefix code over the alphabet {0, 1, can be written in the form 
Pi U [J ve p 2 v#P(v), for some P U P 2 C {0,1}* and P(v) C {0,1,#}* ; with the following 
properties: 

• Pi is a maximal prefix code over {0, 1}; 

• P 2 = {p G {0, 1}* : p is a strict prefix of some element of Pi}. 
Hence, when Pi is finite, this last property implies: |P 2 | = |Pi| — 1. 

• For every v G P 2; the set P(v) is a maximal prefix code over {0, 1, #}. 

Conversely, if P = Pi U [J ve p 2 v# P{v) for some P\, P 2 and P(v) with the above three 
properties, then P is a maximal prefix code over {0, 1, jf}. 

Proof. The proof is straightforward, and similar to the proof of Lemma f4. 71 □ 

Lemma 9.3 Let x,y,u,v G A*. If xu and yv are prefix- comparable then x and y are prefix- 
comparable. Contrapositively , if x and y are prefix incomparable then xu and yv are prefix 
incomparable. 
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Proof. The proof is straightforward. □ 

Lemma 9.4 If P C A* is a maximal prefix code and if x G A* , then x P = {w G A* : xw G 
P} is either empty or a maximal prefix code. Equivalently, if PA* is an essential right ideal 
then x~ 1 PA* is also an essential right ideal. 

Proof. This is a classical property of maximal prefix codes (see e.g. Lemma 8.4 in jH]). □ 

Lemma 9.5 // a maximal prefix code over an alphabet A, \A\ > 2, contains a word of length 
n then this prefix code has cardinality at least n + 1 . 

Proof. This is a classical property of maximal prefix codes (see e.g. □ 
Lemma 9.6 (Lemma 15. 8|) 

Let P C A* be any prefix code, where \A\ = n > 2. Assume ip G pStabg n (PA*), but ip 
pFixg (PA*). Then there exists x G PA* such that x and (p(x) are not prefix- comparable. 

In particular, if ip G Q n ^ is not the identity element then there exists x G domC(v?) such 
that x and (p(x) are not prefix- comparable. 

Proof. Let P' be another prefix code such that P and P' complementary prefix codes (P' = 
if P is a maximal prefix code). Let ip be the restriction of ip to the essential right ideal (PUP') A*. 
So ip is a right-ideal isomorphism that represents <p. 

We will prove the contrapositive of the Lemma: Assume that x and ip(x) are prefix- 
comparable for all x G domC(^) fl PA* , and that ip and ip~ x stabilize PA* where ip and ip~ l 
are defined. Then the restriction of ip to domC(^) D PA* is the identity map. 

Case 1: x >p rc f ip(x) (i.e., x is a strict prefix of ip(x)), for some x G domC(^) fl PA*. 

Then ip(x) = xv for some v G AA*, and ip(x) = xv G imC('0) fl xAA*. By Lemma l9.4( 
s _1 (imC(?/') fl xAA*) is a maximal prefix code. Since it contains the non-empty word v, 
it contains at least two elements (by Lemma I9.5J) . Hence imC(ip) fl xAA* contains at least 
two elements. So there exists x' G domC(V') such that ip(x') ^ ip(x) and ip(x') = xw G 
imC(^) fl xAA* (for some w G AA*). Since ip ±l stabilizes PA* we also have x' G PA*. Since 
imC(^) is a prefix code, the inequality ip(x') ^ ip(x) implies that ip(x') and ip(x) are not 
prefix-comparable. 

By the assumption of the Lemma (or its contrapositive): x' and ip(x') are prefix-comparable. 
Hence we have two possibilities: 

(1) x' < pre f ip(x')\ then x' < pre f ip(x') < pre f x, so x' < pre f x, which contradicts the fact that 
domC('0) is a prefix code. 

(2) x' > P ref ip(x'): then x' > pre f ip(x') = x'z for some z G AA*, and ip(x') = xw (as seen above). 
Hence x'z = xw, which implies that x' and x are prefix-comparable; again, this contradicts the 
fact that domC(^) is a prefix code. 

We conclude that case 1 is impossible. 

Case 2: x < pre f ip(x) for some x G domC(^) fl PA* . 

Then x = ip(x) u for some u G AA*; so, x G domC(^) fl ip(x) AA*). Moreover, by Lemma 
19 .4[ ip(x)~ 1 (domC(ip) fl ip(x) AA*) is a maximal prefix code. Since it contains the non-empty 
word ii, it contains at least two elements (by Lemma r9.5|) : hence, domC(^) fl^^) AA* contains 
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at least two elements. Therefore, there is x' 7^ x with x' G domC('0) D ip{x) AA* . Moreover, 
since ip ±l stabilizes PA*, we have x' G PA*. 

By the assumption of the Lemma (or its contrapositive): x' and ip{x') are prefix-comparable. 
Hence we have two possibilities: 

(1) %' > P ref ip{x')\ then ip(x') < pre f x' and x' < prc f V'(x) (since x' G ^(a^AA*). Therefore, 
■0(a/) < pre f which contradicts the fact that imC(-?/>) is a prefix code. 

(2) x' < pre f if)(x'): then ip(x') > pro f x' = ip(x f ) z for some z G AA*; and x' = if)(x) w for some 
u? G AA* (since x' G ip(x) AA*). Therefore, x' = ^(^O 2 = VK 2 -) ^ which implies that ip(x') 
and i/j(x) are prefix-comparable; again, this contradicts the fact that imC(^) is a prefix code. 

We conclude that case 2 is impossible. Now, having ruled out cases 1 and 2, the only 
remaining possibility is that x = ip(x). □ 

It is well known (and easy to prove) that every finite maximal prefix code P over an alphabet 
A has cardinality \P\ = 1 + (|A| — l)ip, where ip is the number of internal vertices of the 
prefix tree of P. Also, for every integer % > 0, there exists a maximal prefix code P over an 
alphabet A of cardinality 1 + (|A| — 1) i. 

Lemma 9.7 (Lemma 15. 9|) 

Suppose P,P f C A* are complementary finite prefix codes. Let Xi, . . . ,x& G PA*, and assume 
x\, . . ., Xfe are two-by-two prefix-incomparable. Then for all n of the form n = 1 + i (|A| — 1), 
TOi/i n > \P\ — k + (|A| — 1) (|xi| + . . . + |xfc|), there exists a prefix code Q such that 

• Q U {xi, . . . , Xfc} ana" P' are complementary prefix codes, with Q U {xi, . . . , x^} C PA*; 

• \Q\=n. 

• The set of prefixes of P is a subset of the set of prefixes of QU {xi, . . . , x^}. 

Proof. Since x, G PA* we can write Xj = for some Pi G P and G A* (i = 1, . . . , k). 
Moreover, pi and Ui are uniquely determined by Xj (since P is a prefix code). 

We will prove the Lemma only when k = 2, for clarity; the proof for general k is very similar. 

Let U\ = Cic 2 . . . c m and u 2 = d\d 2 ■ ■ ■ d n . In this Lemma we define a = A — {a}, for all 
a G A. We define 

Qo = (P-{pi,p 2 }) U Pi (ci U c{c 2 U C1C2C3 U U Cic 2 . . . c m _ic m ) 

U P2 (rfi U G?i<i2 U d\d 2 d 3 U U did 2 . . . d n _id n ). 

In the special case where X\ = p\, we let 

Qo = (P - {P2}) U p 2 (di U did 2 U did 2 d 3 U U d x d 2 . . . d n _i~d n ). 

and similarly if X2 = p 2 - Moreover, if both x\ = p\ and X2 = p 2 , we simply let Qo = P. 

The formula for Q implies directly that |Q | = \P\-2+ (\ui\ + \u 2 \)(\A\ - 1) < |P| - 2 + 
(|x 1 | + |x 2 |)(|A|-l). 

Intuitively, we can picture the set pi {c\ U Cic 2 U Cic 2 c 3 U . . . U c\c 2 . . . c m _iC TO ) U {xi} 
on the prefix tree of A*: Consider the path labeled by P\U\ = X\, and consider all the vertices 
attached to this path, but not on the path. The set above consists of these attached vertices, 
but excluding the prefixes of pi (i.e., we start after pi), and excluding the leaves of x\. E.g., if 
A = {a,b}, and x% = paaba, then the set is {pb,pab,paaa,paabb,paaba}. For x 2 , the intuition 
is similar. 
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From the prefix tree picture it is obvious that Qo U {xi,X2} U P' is a maximal prefix code, 
since it consists of the set of leaves of a subtree of A*. 

Finally, if we want a prefix code Q as in the Lemma, with cardinality exactly n, we simply 
take Qo and repeatedly replace some leaf in the set p\ {c~i, C\C?, C1C2C3, . . . , C\c? . . . c m -ic m } 
by its \A\ children; each such step increases \Q\ by \A\ — 1, while preserving the fact that 
Q U {xi, X2} U P' is a maximal prefix code, and Q C PA* . 

The fact that the set of prefixes of P is a subset of the set of prefixes of Q U {xi, . . . , Xk} 
follows immediately from the fact that Q U {xi, . . . , Xk} C PA* □ 

Lemma 9.8 (Lemma 15. lip . 

(1) For all x, y G {0,1}* there exist letters l\,l? G {0,1} such that xl\, and yti are prefix 
incomparable. 

(2) For all x,y,z G {0,1}* there exist letters Ix,...,£q G {0,1} such that xl\t%, ytit±, and 
zi^is, are prefix incomparable. 

Proof. (1) If x, y are prefix incomparable, then any l\, I2 G {0, 1} will work, by Lemma f9. 31 
If x = y,then xO and yl are prefix incomparable. 

So let's suppose x is a strict prefix of y. Then either xO is a prefix of y (and then xl is prefix 
incomparable with y and yl?), or xl is a prefix of y (and then xO is prefix incomparable with 
y and y£ 2 )- 

The case where y is a strict prefix of x is very similar to the previous case. 

(2) By (1) there are letters £±,£2 G {0, 1} such that x£\ and yt?, are prefix incomparable. Then 
by Lemma EPl xlil%, ^2^4 are prefix incomparable, for any £3,^4. 

If z is prefix incomparable with both x£\ and yl-i then any choice of £3, . . ., £$ will work, by 
Lemma 19.31 

If z is a common prefix of both x£\ and y£2 then either zO or zl will be prefix incomparable 
with both x£\ and y£2\ then any choice of £3, £4, and e^e will work. 

The only remaining cases are when z is prefix incomparable with exactly one of xl\ and yl?,. 
Let's say, z is prefix incomparable with yl?. Then by (1), z£s and XI1I3 are prefix incomparable 
for some eZ/5, £3 G {0, 1}. Then by Lemma [9.31 yl?l4 and zl 5 are also prefix incomparable, as 
well as yl?li and zl 5 l 6 . □ 

Lemma 9.9 (Lemma 16. ip . 

(0) Every finite maximal prefix code P over an alphabet A ( e.g., A = {0, 1, #} ) has cardinality 
|P| = 1 + (\A\ — 1) ip, where ip is the number of inner vertices of the prefix tree of P. 

If \P\ > 1 then P contains a subset of the form uA (for some word u G A*). 
Also, for every integer i > 0, there exists a maximal prefix code P over an alphabet A of 
cardinality 1 + (\A\ — 1) i. 

(1) If P C {0, 1}* {e, #} ; and \P\ > 1, then P contains a subset of the form 

u {0, 1,#}, for some u G {0,1}* 

(2) For every integer i > 3 there is a maximal prefix code P C {0, 1}* {e, #}, with \P\ = l+2i, 
and with the following property: 

P contains a subset of the form {u, v} {0, 1, for some u, v G {0, 1}*, u/d. 
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(3) For every integer i > 5, there is a maximal prefix code P C {0, 1}* {e, with \P\ = 
1 + 2i, and with the following property: 

P contains a subset of the form {u, v, w} {0, 1, for some u,v,w G {0, 1}*, 
with u,v,w distinct two-by-two. 

Proof. Property (0) is well known (see e.g., |14j . [3]). 

For property (1), recall Lemma \4. 71 about maximal prefix codes C {0, 1}* {e, If \P\ > 1 
then \Pi\ > 1. Any maximal prefix code Pi over {0, 1} with \Pi\ > 1 contains a subset of the 
form u {0, 1} (for some u G {0, 1}*). Hence u G P2, and P will also contain uj^. 

For property (2), let P\ be a maximal prefix code over {0, 1}, such that {u, v} {0, 1} C Pi, 
for some words u, v G {0, 1}* with u 7^ v. For any n > 4, such a Pi exists with cardinality 
|Pi| = n. This is folklore knowledge on prefix codes. One can prove it, e.g., by looking at the 
inner tree of the tree of a maximal prefix code. One takes an inner tree with 2 leaves (it suffices 
for the inner tree to have 3 vertices, arranged in the shape A ). Then among the vertices of 
the tree of the maximal prefix code there will be 2 vertices, each of which has 2 leaves. 

Recall that Pi determines P2 (and IP2I = |Pi| — 1 = n — 1), thus for any n > 4 we obtain a 
maximal prefix code P = PiUp2# of cardinality |P| = n + n— 1 = l + 2i, with i = n — 1 > 3. 

Property (3) is proved in a similar way as property (2). We take an inner tree with 3 leaves 
(it suffices for the inner tree to have 5 vertices), in the shape 

A 
A 

This proves the Lemma. □ 
Lemma 9.10 (Lemma 16. 3|) . 

• Suppose that there exists a maximal prefix code Q over the alphabet {0, 1}, whose inner tree 
Ti n (Q) has two one-child vertices at depths = i mod 3 (for some i G {0,1,2}). Then there 
exists a maximal prefix code P C {0, 1}* with the same mod 3 cardinality as Q, and with the 
following property: 

there is a word u G {0, 1}* such that u ■ {0, 1} C P and \u\ = i mod 3. 

Equivalently, the inner tree of the prefix code P has a leaf at depth = i mod 3. 

• More generally, let k > 2, let i 1 , . . . ,i k G {0, 1, 2}, and suppose that T in (Q) has the following 
property: For every A (1 < X < k), T- m (Q) has a leaf of depth = i\ or it has two one-child 
vertices at depths = i\ mod 3. 

Then there exists a maximal prefix code P C {0, 1}* with the same mod 3 cardinality as Q, 
and with the following property: 

there are k different words u±, . . . , u k G {0, 1}* such that {ui, . . . , u k } ■ {0, 1}CP and 

\u±\ = i\, ... , \uk\=ik, mod 3. 
Equivalently, the inner tree of the prefix code P has at least k leaves that have depths respectively 
= i\, ■ ■ ., = ik mod 3. 

Proof. We start with the maximal prefix code Q and we transform it into a maximal prefix 
code P that has the required properties. The transformations consist of rearrangements of the 
existing vertices of T m {Q). 
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Let us look at two one-child vertices A, B of T in (Q), at depths = i\ mod 3. If A and B are 
not on a common path from the root, we transform T in (Q) as follows: 

\ \ 
/ . . . B / . . . B 

A \ A / \ 

/ z becomes w z 

w STz STw STz 

STw 



Here, w is the child of A and z is the child of B. Moreover, STw is the subtree with root w and 
STz is the subtree with root z. We moved the subtree STw with its root w to the unoccupied 
child position of B. After the transformation, vertex A is a leaf of depth = i\ mod 3. Since the 
depths of A and B are equivalent mod 3, the above transformation preserves the depth mod 3 
of all nodes. 

If A and B are on a common path from the root, we transform T m {Q) as follows: 

/ / 
B B 
/ / \ 

w 
STw 

. \ . \ 

/ becomes / 

A A 

/ 

w 
STw 



Again, after the transformation, vertex A is a leaf of depth = i x mod 3. The depths (mod 3) 
of all nodes are unchanged. 

The proof in the general case is very similar, and can be done by induction. If T m already 
has some leaves (either present initially or obtained by transformations as above) the additional 
transformations don't remove these leaves and don't change their depths mod 3. □ 
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9.2 Commutation test for C7 M and G™f 3 

We show in this subsection that the commutation test works for some fixators in G Ui i and 
G™° d3 . The definitions and proofs are simpler that for the case of the group G™° d3 (0, 1; jf) 
studied in Section 5. The rest of this paper does not depend on this subsection. 
For the finite alphabet A below we assume \A\ > 2. 

Definition 9.11 Let G C Q nl (i.e., G is a subgroup with a particular embedding), and let P, 
P' be complementary prefix codes over an alphabet A, with \ A\ = n. 

The fixator pFix G (P'A*) is separating on PA* iff for any ordered pair (x,y) of prefix- 
incomparable words x, y G PA* there exists h G pFix G (P'A*) and there exists u G A* such 
that 

h(xu) = xu and h(yu) ^ yu. 

Lemma 9.12 Let G C Q n l and let P , P' be complementary prefix codes over an alphabet A. 
If a fixator pFix G (P'v4*) is separating on PA* then it is a maximal fixator. 

Proof. Recall the definition of maximal fixator from Section 5. Suppose by contradiction 
that for some yo £ -P^4* we have for all h G pFix G (P'v4*): h(y ) = y Q . However, x = y§a and 
y = yob are prefix- incomparable (for two letters a ^ b G A). Hence, by the separating property, 
ho(yv>o) 7^ yu for some h G pFix G (P'A*) and some u G A*. On the other hand, h (y ) = y 
implies h (yu ) = h (y buo) = h (y ) bu = yobu = yu . Now we have both h (yu ) ^ yu and 
h (yu ) = yu . □ 

Proposition 9.13 Let G C Q n .\ be a subgroup (with an embedding), and let P , P' be comple- 
mentary prefix codes over A, with n = \A\. 

(1) For all g G pFix G (PA*) and all h G pFix G (P'A*): gh = hg. 

(2) Suppose pFix G (P' A*) is separating on PA* . Then we have for every g G G: 

if gh = hg for all h G pFix G (P'A*), then g <E pFix G (PA*). 

Proof. (1) is straightforward. To prove (2), suppose g G G commutes with all h G 
pFix G (P'A*). Then g~ l also commutes with all h G pFix G (P'A*). 

Claim: g stabilizes PA* and P'A*, where defined. 

Proof of the Claim: Assume, by contradiction, that g(x') = y for some x' G P'A*, y G PA*. 
Then we have for all h G pFix G (P'A*): hg(x') = gh(x') = g(x'). So, all of pFix G (P'A*) 
fixes y G PA*. This contradicts the maximality of pFix G (P'v4*), and hence it contradicts the 
separation property, by Lemma f9. 121 Therefore, g maps P'A* into P'A*. Similarly, g' 1 maps 
P'A* into P'A*. 

If we had g(x) = y' for some x G PA*, y' G P'A*, then g^ 1 (y') = x, contradicting the fact 
that g~ l maps P'A* into P'A*. Thus, g maps PA* into PA*. This proves the Claim. 

To prove that g G pFix G (PA*) , assume by contradiction that g{x\) = y± ^ X\, for some 
X\ G PA*; by the Claim, y\ G PA*. By Lemma 15.81 there exist therefore x, y G PA* such 
that g{x) = y and x and y are prefix-incomparable. Now we have for all h G pFix G (P'A*): 
gh(x) = h(y). Hence for all u G A*, gh(xu) = h(yu). 



54 



But by the separating assumption, there exists ho £ pFix G (P' A*) and there exists w £ A* 
such that ho(yuo) ^ yuo and h (xuo) = xuo] the latter, together with gho(xuo) = ha(yuo), 
proved above, implies (g(xuo) =) yuo = h (yu ). Now we have both h (yuo) ^ yu and 
yu = h (yuo), a contradiction. □ 

Proposition 9.14 Let P, P' C A* be finite non-empty complementary prefix codes with \A\ — 
n > 2. Then pFix G (P' A*) is separating on PA* for the following groups taken for G: 

(1) G = G Uj i, 

(2) G = G%?*. 

Proof. (1) Let x, y £ PA* be two prefix-incomparable words. Let a ^ b £ A (any two 
different letters); note that this makes the words x,ya,yb prefix- incomparable two-by-two (for 
x and ya, use Lemma EH and similarly for x and yb). Now use Lemma 15.91 to construct a 
maximal prefix code Q U {x, ya, yb} U P', with Q C PA*. Define foo £ Gxi by 

/io(2/a) = y6, h (2/6) = Z/a, hoOO = s > and 
fa is the identity on QU P'. 

So, Q U {x, ya, y6} U P' is the domain code and image code of ho- Then ho £ pFix Gn ^P'A*), 
h (ya) 7^ ya, and h(xoa) = xa (since ho(x) = x). So here, a plays the role of uq in the separation 
property. 

(2) Note that h preserves lengths, so h £ G™° d3 , which proves that pFix G mod3(PM*) is 
separating too. □ 

Corollary 9.15 (Commutation test). 

Let A be an alphabet with \A\ = n > 2. Let G = G n> \ or G 
complementary prefix codes over A. Then for any g £ G we have: 
g £ pFix G (PA*) iff gh = hg for all h e pFix G (P' A*) . 



= G™° d3 ; and let P,P' be 
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9.3 Commutation test, finite presentation, and word problem of 

Gff 3 (o, i) 

We prove that the commutation test works for G™° d3 (0, 1), thus reducing the circuit equivalence 
problem to the word problem of G™° d3 (0, 1) (over an infinite generating set). 
Then we show that G!f° d3 (0, 1) is finitely presented. 

Finally, we embed G™° d3 (0, 1) into a finitely presented Thompson group iP^O, 1); thus show- 
ing that P(0, 1) is a finitely presented group with coNP-hard word problem. 

So, G™° d3 (0, 1) and the corresponding group H (0, 1) have similar properties as G™° d3 (0, 1; #) 
and H(0, The proofs are similar too, but a little more complicated in the case of 
G™ d3 (0, 1). The other sections of this paper do not depend on this subsection. 

Definition 9.16 Let G C G™ d3 {0, 1). Let P, P' be complementary prefix codes over {0, 1, 
with P n {0,1}* ^ 0, P' n {0,1}* ^ 0. The fixator pFix G (P'{0, 1, #}*) is separating on 
P{0, 1, #}* iff the following hold for any ordered pair (x, y) of prefix-incomparable words x, y G 
{0,1}* U {0,1}*#{0,1,#}*: 

• If x,y G (PC {0, 1}*) {0, 1}* ; then there exists h G pFix G (P'{0, 1, #}*) and there exists 
u G {0, 1}* such that 

h(xu) = xu and h(yu) ^ yu. 

• If x,y {0, 1}*, and x,y G P{0, 1, #}* there exists h G pFix G (P'{0, 1, #}*) such that 

h(x) = x and h(y) ^ y. 

(Note that we don't have any requirements in the case where x G {0, 1}* and y ^ {0, 1}*, or 
the case where x {0, 1}* and y G {0, 1}*.) 

Lemma 9.17 7/pFix G (P'{0, 1, #}*) is separating on P{0, 1, #}* then it is a maximal fixator. 

Proof. Recall the definition of a maximal fixator from Section 5. Suppose by contradiction 
that there exists x G P{0, 1,#}* such that h(x Q ) = x for all h G pFix G (P'{0, 1, #}*). By 
Lemma the prefix code P is of the form P = Pi U Uueft v $ P{ v )-> where Pi = Pfl {0, 1}*. 
Case 1: x G Pi{0,l}*. 

Choose x = xoO and y = xqI. Then x and y are prefix incomparable, hence by the separation 
property of the fixator, there exists h G pFix G (P'{0, 1, #}*) and u G {0, 1}* with 

h (xu ) = xu , h (yu ) ^ yu . 

However, h (yuo) ^ yu contradicts the fact that h (xo) = x . 

Case 2: x G P{0, 1}*#{0, 1, #}*, or x G \J vin p 2 v # P ( v ) with |P 2 | > 2. 

Let Xq = t>o#s. Let w G P 2 with w ^ v , and choose x = w #t (for some t G P(w )) and 
y = v #s. Then x and y are prefix incomparable, and both are in {0, 1}*#{0, 1, #}*; so there 
exists h G pFix G (P'{0, 1, #}*) with 

h (x) = x, h (y) ^ y. 

However, ho(y) ^ y contradicts the fact that /io(^o) = ^o- 

Case 3: xq G [j vin p 2 v^P(v) and | -F2 1 = 1- (Obviously the case | P2 1 = cannot occur when 
xo G P 2 #.) 
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Then P 2 = {vq}, so we have x = Vq^s G t>o#P(t>o), for some s G P(vq). Let Zo G Pi (recall 
that we assume P x ^ 0). Let x = 2 # an d V = x o = v oi^ s - Since z 7^ u 0; # an d 2/ are prefix 
incomparable, and both are in {0, 1}*#{0, 1, #}*; so there exists ho G pFix G (P'{0, 1, #}*) with 

/io(aj) = z, ho(y) 7^ 2/- 
Again, /io(?/) 7^ 2/ contradicts the fact that ho(xo) = xq. □ 

Proposition 9.18 Let P, P' be complementary prefix codes over {0, 1, with Pfl{0, 1}* and 
P'n {0, 1}* non-empty. Suppose that G C £™ d3 (0, 1) is a group, and that pFix G (P'{0, 1, #}*) 
is separating on P{0, 1, #}*. Then for all g £ G we have: 

If g commutes with all elements o/pFix G (P'{0, 1, #}*) then g G pFix G (P{0, 1, #}*). 

Proof. Let jgG and assume g commutes with all elements of pFix G (P'{0, 1, #}*)■ We want 
to show that g G pFix G (P{0, 1, #}*). We first prove: 

Claim: g stabilizes P'{0, 1, #}* and P{0, 1, #}*. 

Proof of the Claim: Assume by contradiction that g(x') = y for some x' G P'{0, 1, #}* 
and y G P{0, 1,#}*. Since g commutes with all elements of the fixator we have for all h G 
pFix G (P'{0, 1, #}*): gh(x') = hg(x') = g(x') = y, i.e., h(y) = y. This contradicts the 
maximality of the fixator pFix G (P'{0, 1, #}*); so g maps P'{0, 1, #}* into itself. 

Similarly, g^ 1 maps P'{0, 1, ^}* into itself. From this it follows (as in the proof of Propo- 
sition |mH) that g also maps P{0, 1,#}* into itself, and similarly for y -1 . This proves the 
Claim. 

Assume now by contradiction that g does not fix some element X\ G P{0, 1, #}*: #(xi) = yi 7^ 
x x . By the Claim, y x G P{0, 1, #}*. 

By Lemma f5. 81 there exist x, y G P{0, 1, #}* such that x and ?/ are prefix incomparable and 
g{x) = y. And since g commutes with the fixator, we have for all h G pFix G (P'{0, 1, #}*): 
g h(x) = hg(x) = h{y). 

On the other hand, the separation property of the fixator implies that there exists ho G 
pFix G (P'{0, 1, #}*) and u G {0, 1, #}* such that ho(yuo) ^ yuo and ho(xuo) = xuq. 

The equality gh{x) = h(y) implies gho(xuo) = h(yuo); this, together with ho(xuo) = xuo, 
implies yuo = gho(xuo) = h(yu ). But this contradicts ho(yuo) ^ yuo- □ 

In the next two Lemmas we will check that the group G = Cj3 n ° d3 (0, 1) satisfies the conditions 
of Proposition 19.181 i.e., that the fixator pFix G (P'{0, 1, #}*) is separating. 

Proposition 9.19 Let P,P' be complementary prefix codes over {0,1, jf\ with P fl {0,1}* 
and P' n {0, 1}* non-empty. Let G = G^ d3 (0, 1). Then the fixator pFix G (P'{0, 1, #}*) is 
separating on P{0, 1, #}*. 

Proof. Let x,y G Pi{0, 1}* or x,y G Pi{0, 1}*# U [J ve p 2 v # P(v), and assume x and y are 
prefix incomparable. We want to find h G pFix G (P'{0, 1, #}*) and Uq G {0,1}* such that 
ho(xu ) = xu and h (yu ) ^ yu , or h (xu ) ^ xu and h (yuo) = yu ; if x, y ^ {0, 1}* then 
Uq is empty. 

Case 1: x,y e Pi{0,l}*. 

In this case we can apply the same proof as for Proposition 19. 14( with alphabet A = {0, 1}. 
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Case 2: x,y G Pi{0, 1}*#{0, 1,#}* U \J veP2 v#P(v) {0,1,#}*. 

Let x = x #s and y = y #£o 
Case 2.1: y e Pi{0,l}*. 

• Assume £o G P 2 . 

Now, x 7^ yo, since x G P 2 and yo £ Pl{0, 1}*; since P 2 is closed under prefix (by Lemma 
19. 2|) . ?/ is not a prefix of xo- By Lemma 15.91 over the alphabet A = {0, 1}, there is a finite 
prefix code Qi C Pi{0, 1}* such that Q\ U {yoOO} and P[ and complementary prefix codes (over 
{0, 1}). Therefore the following will be a finite maximal prefix code over {0, 1, #}: 

C = Q 1 U{2/ 00}UP[U \J veQ2 v#H(v), 
where Q 2 = > prc f (Qi U {y 00} U Pf); moreover, 11(f) = P(f) if f G P 2 , 11(f) = P'(f) if 
v G P 2 ', and 11(f) consists of just the empty word e otherwise. 
Now we define ho, with domain code and image code C, by 

h (yo#t ) = w 0#, h o (y o 0#) = y #t , and 

ho is the identity everywhere else on C. 
Thus, h (y) ^ y. Moreover, h G pFix G (P'{0, 1, #}*), because y 0# £ v # p '( v ); indeed, 

y 0#GP 1 {0,l}*# CP{0,1,#}*. 

And /io preserves the length of strings in {0, 1}* (since ho is the identity on {0, 1}* wherever 
h is defined). 

We also claim that h (x) = x. Indeed, xo belongs to P 2 , which is contained in > pre f(Pi UP{); 
moreover, > pre f(Pi) C > pro f(<3i), by the 3rd point of Lemma l5~H| and s G P'(xo). Therefore, 
x o# s o belongs to C . On the other hand, x is different from y and y 0. 

• Assume x G Pi{0, 1}*. 

Then, by Lemma 15.111 there are £±,£2 G {0,1} such that Xo£i and yo£ 2 are prefix incom- 
parable. By applying Lemma 15.91 over the alphabet A = {0, 1} we obtain a finite prefix code 
Qi C Pi{0, 1}* such that Qi U {xo£i,yo£ 2 0} and P( and complementary prefix codes (over 
{0, 1}). Therefore the following set C C {0, 1}* U {0, 1}*# will be a finite maximal prefix code 
over {0,1, #}: 

C = Q 1 U{x £ u y £ 2 0}UP{U LU 0a v # n(f), 
where Q 2 = > pre f(Qi U {x £i, yo^O} U P/), and where 11(f) = P(f) if f G P 2 , 11(f) = P'{v) 
if f G P 2 , and 11(f) consists of just the empty word otherwise. 

Now we define ho, with domain code and image code C, by 

ho(yo#t ) = y 4#, h (y £ 2 #) = yo#^o, and 

/2.0 is the identity everywhere else on C. 
Thus, h (y) ^ y. Moreover, h G pFix G (P'{0, 1, #}*), because y 4# ^ U oe p' U #P'( U ); indeed, 
2/o^2# GP X {0,1}*# CP{0,1,#}*. 

And ho preserves the length of strings in {0, 1}* (since ho is the identity on {0, 1}* wherever 
it is defined). Also, h (x) = x, since x #s belongs to C (since x is a strict prefix of x £i), 
and since x is different from y and yo^2- 

Case 2.2: y e P 2 . 

Since Pi 7^ 0, there exists Wq G P\\ hence yo is different from wo, woO, and tfoOO. Also, 
x is different from fj or from fj 00 (or from both). Let z be one of w or u> 00, so that 
z ^ x . We still have z ^ y and z G Pi{0, 1}*. 
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• Assume io 6 ft- 

By Lemma 1531 over the alphabet A = {0, 1}, there is a finite prefix code Q% C Pi{0, 1}* such 
that Qi U {^oO} and P[ and complementary prefix codes (over {0, 1}). Therefore the following 
set C C {0, 1}* U {0, 1}*# will be a finite maximal prefix code over {0, 1, #}: 

C = Q 1 U{z 0}UP{U U* e0a «#n(t;), 

where Q 2 = > prc f(Qi U {^ 0} U P[), and where U(v) = P(v) if v G P 2 , U{v) = P'{v) if v G P^, 
and n(f ) consists of just the empty word otherwise. 

Now we define ho, with domain code and image code C, by 

h (yo#t ) = z #, M^o#) = 2/o#*o, and 
/io is the identity everywhere else on C . 

Thus, h (y) ^ y and h (x) = x. Note that /io(^o# s o) an d ^o(2/o#^o) are defined since Xo>2/o G 
P2 C > pre f (-Pl U P{); moreover, > pre f(Pi) C > pre f(Qi), by the 3rd point of Lemma l5~9l 
Therefore, xo#so and yo#t belong to C. 

Also, h G pFix G (P'{0, 1, #}*), because z # <£ \J veP , v#P'{v); indeed, z # G P{0, 
C P{0,1,#}*. 

Also, /iq preserves the length of strings in {0, 1}* since ho is the identity on {0, 1}* wherever 
it is defined. 

• Assume Xo G Pi{0, 1}*. 

Then, by Lemma 15.111 there are £i,£ 2 G {0,1} such that xq£i and 20^2 are prefix incom- 
parable. By applying Lemma 15.91 over the alphabet A = {0, 1} we obtain a finite prefix code 
Qi C Pi{0, 1}* such that QiUjxo^i, zo£ 2 } and P[ and complementary prefix codes (over {0, 1}). 
Therefore the following set C C {0, 1}* U {0, 1}*# will be a finite maximal prefix code over 
{0,1, #}: 

C = Q 1 U{x £ 1 ,z e 2 }UP{U [J veQ2 v#Il(v), 

where Q 2 = > pre f(Qi U {x £i, z £ 2 } U P[), and where U(v) = P(v) if v G P 2 , II (v) = P'(f) 
if i> G P 2 , and 11(f) consists of just the empty word otherwise. 

Now we define ho, with domain code and image code C, by 

ho(yo#t ) = z #, h (z #) = y #t , and 
ho is the identity everywhere else on C . 

Thus, h (y) ^ y, and y G C (since y G P 2 C > prcf (Pi U P/) C > prcf (Qi U P[) ). 

Moreover, h G pFix G (P'{0, 1, #}*), because z # ^ P'{0, 1, #}* for the same reasons as in 
the previous subcase. 

And ho preserves the length of strings in {0, 1}* (since h is the identity on {0, 1}* wherever 
it is defined). Also, h (x) = x, since x #s belongs to C (since x is a strict prefix of x £i), 
and since x ^ y and xo ^ zq. □ 

Propositions 19.181 and 19.191 immediately imply: 

Corollary 9.20 (Commutation test for G^ d3 (0, 1)). 
Let G = Gff 3 (0, 1). For any g G G we have: ' 

g GFbc G (0{0, !,#}*) iff gh = hg for all h G pFix G ({l, #}{0, 1, if}*)- 



59 



Now, by the same reasoning as in Section 5, the above Corollary reduces the circuit equiv- 
alence problem to the word problem of G™i d3 (0, 1); the reduction is an unbounded con- 
junctive reduction. The next Lemma implies that pFix G ({l, #}{0, 1, #}*) is isomorphic to 
G = G^ d3 (0, 1). This and the fact (proved later in this subsection) that G^° d3 (0, 1) is finitely 
generated implies that only the finitely many generators of pFix G ({l, #}{0, 1, #}*) need to be 
used in the role of "/i" in the above Corollary. This then yields: 

Corollary 9.21 The circuit equivalence problem reduces to the word problem of G™° d3 (0, 1) 
(over an infinite generating set), by a polynomial-time k-bounded conjunctive reduction. Here, 
k is the minimum number of generators o/G™° d3 (0, 1). 

Lemma 9.22 For G = G^ d3 (0, 1), the subgroup pFix G ({l, #}{0, 1, #}*) is isomorphic to G. 

Proof. An element up G G = G™ d3 (0, 1) = pStab Gfl o d3 ({0, 1}*) belongs to Fix G ({l, #}{0, 1, #}*) 
iff if has a table of the form 

T 1 # Oxi ... 0x n 0xi#si . . . Ox' m #s m 
V ~ [ 1 # Oyi ■ ■ ■ 0y n Oy'^h . . . Oy' m #t m 

where Xi,yi,x'j,yj range over {0,1}*, \xi\ = \yi\ mod 3 (i — 1, . . . , n), and Sj,tj G {0, 1,#}*. 
The isomorphism to G, as above, just maps this table to 

, X\ . . . X n X^^S\ . . . X m ^ z S m 

[yi ■■■ Vn y[#h . . . y' m #t m _ 

It is straightforward to see that this is an isomorphism. □ 

We will prove next that the group G < 3 1 ° d3 (0, 1) is finitely presented. As in Section 6, we 
will follow Higman's method and, accordingly, we will have to prove appropriate facts about 
maximal prefix codes over {0,1, #}. We will use the following notation (as before): For any 
maximal prefix codes P, Q C {0, 1, #}*, we let Q 1 = Q n {0, 1}* and P 1 = P n {0, 1}*; these 
are maximal prefix codes over {0, 1}. 

Lemma 9.23 Let Q be a finite maximal prefix code over {0,1, #}. Suppose the inner tree 
Tm(Q) has a leaf £ G {0, Assume that there exist elements q\,q2 G Q such that, q±,q2 
are not children of £ (in the prefix tree ofQ), and such that either both qi,q2 ^ {0, 1}*, or both 
<?i?<?2 £ {0, 1}*; in the latter case we also assume that \qi\ = \q 2 \ mod 3. 

Then there exists a finite maximal prefix code P over {0, 1, such that \P\ = \Q\, Pi has 
the same mod 3 cardinality as Q 1} and the inner tree T in (P) has two leaves £ 1; £ 2 G {0, 1, #}* 
such that: 

(1) If both q 1 ,q 2 G {0,1}* then £ 2 G {0,1}*, with \£ 2 \ + 1 = |gi| = \q 2 \ mod 3. Moreover, if 
£ i {0, 1}* then £ t <£ {0, 1}*; if £ £ {0, 1}* then l x G {0, 1}*, and = \£\ mod 3. 

(2) Ifboihq u q 2 i {0,1}*, andif qi or q 2 <£ {0, 1}*# ; then £ 2 <£ {0,1}*. Moreover, if £ £ {0,1}* 
then £ x £ {0, 1}*; if £ G {0, 1}* then £ x G {0, 1}*, and \£ x \ = \£\ mod 3. 

(3) Ifbothq u q 2 G {0, !}*#, and if £ ^ {0,1}*, then £ 2 <£ {0,1}*. 



60 



Note that the case where q±, q 2 G {0, 1}*# and £ G {0, 1}* is not considered in the Lemma (and 
will not be needed). 

Proof. Obviously, T- m (Q) has at least one leaf. 

(1) If qi, q 2 G {0, 1}* we do the following transformation on Q, where Z\ and z 2 are the parent 
vertices of q±, respectively q 2 , and where ST1,ST2, ST3, ST4 are subtrees (below z\ or z 2 ) of 
the prefix tree of Q. If Z\ and z 2 are on a common path from the root we let Z\ be the deeper 
one of the two. 

I \ becomes I \ 

zl z2 zl z2 

/ I \# / I \# / I \# / I \# 

ST1 ST2 ST3 ST4 ST1 ST3 ST4 

I 

ST2 



/ / 
z2 z2 
/ I \# / I \# 

ST4 . ST1 ST4 

becomes . I 

ST2 

/ / 
zl zl 
/ I \# / I \# 

ST1 ST2 

Let P be the prefix code described by the transformed tree. Then \Q\ = \P\ (since the 
number of vertices has not changed), and P\ has the same mod 3 cardinality as Q\ (since the 
subtree ST1 is moved from z\ to z 2 and z\, z 2 have equivalent depths mod 3). The subtree ST2 
was under # and is still below a #-edge. Finally, Ti n (P) has two leaves, namely I and z\ (both 
G {0, 1}*), and \z\\ + 1 = |gi| (actually the two numbers are equal). The existing leaf I is either 
unchanged, or (in case it was in ST1) changed to a leaf that has an equivalent depth modulo 
3, or (in case it was in ST2, hence was ^ {0, 1}*) changed to a leaf ^ {0, 1}*. 
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(2) If qi ,q 2 <£ {0, 1}*, and q 1 {0, 1}*# (or, similarly, if q 2 <£ {0, 1}*#), we do the following 
transformation on Q. As above, z\ and z 2 are the parent vertices of q±, respectively q 2 , and 
ST1, ST2, ST3, ST4, ST5 are subtrees (below z\ or z 2 ) of the prefix tree of Q; one of ST3 or 
ST5 is a single vertex (corresponding to the word q 2 ). If Z\ and z 2 are on a common path from 
the root we let Z\ be the deeper one of the two. Since q\ {0, 1}*# and qi £ {0, 1}*, we have 
Zi {0, 1}*; i.e., # appears on the path root-to-^. 

I# \ l# \ 

I \ becomes I \ 

zl z2 zl z2 

/ I \ / I \# / I \# / I \# 

ST1 ST2 ST3 ST4 ST5 ST3 ST4 ST5 

I 

ST1 
I 

ST2 



/ / 
z2 z2 
/ I \# / I \# 

ST4 . ST4 

#. becomes #. I 

ST2 

/ / I 

zl zl ST1 

/ I \ / I \ 

ST1 ST2 



Then the transformed tree describes the desired prefix code P. In particular, Pi = Qi, since 
all the changes happen below ^-edges. The existing leaf £ is either unchanged, or (in case it 
was in ST1 or ST2, in which case £ {0, 1}*) changed to a leaf that is below a #-edge. 

(3) Suppose qi,q 2 e {0, 1}*# and £ £ {0, 1}*. 

• If T m (Q) has any subtrees other than the path root-to-£, then T- m (Q) has another leaf besides 
t. In this case we have nothing to prove. 

• If T in (Q) consists only of the path root-to-£, let p be the parent vertex of £ in T in (Q). Since 
qi,q 2 € {0, 1}*#, the path root-to-£ of T- m (Q) has some edges labeled over {0, 1}, and at least 
one Zi or z 2 (the parent vertices of qi and q 2 in the prefix tree of Q) is at least 2 depth levels 
above £; assume Zi is the deeper one. 
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We do the following transformation on Q. 



(#)/ 



/ 
z2 
/l\# 
q2 



(#)/ 



/ 
z2 
/l\# 
q2 
/l\ 



/ / 

P P 
(#)/|\ becomes (#)/l\ 

1 1 
/l\ 



In other words, the three children of £ are moved to q2- Now \P\ = \Q\ (since no additional 
vertices are added), and Pi — Qi, hence Pi and Qi have the same mod 3 cardinality. Also, 
T in (P) has two leaves, namely q2 £ {0, 1}*, and p. □ 

Again, for maximal prefix codes P, Q over {0, 1, #} we will use the notation Pi — P n {0, 1}*, 

Q 1 = Qn{o,i}*. 

Lemma 9.24 Let Q be a finite maximal prefix code over {0,1, Suppose the inner tree 
Ti n (Q) has two leaves £i,£2 G {0, Assume that there exist elements qi,q2 G <5 sitc/i 
that, qi,q2 are not children of i\ or £ 2 (in the prefix tree of Q), and such that either both 
Qiy <?2 ^ {0, 1}*, or both qi, q 2 G {0, 1}*; in the latter case we also assume that \qi\ = \q2\ mod 3. 

Then there exists a finite maximal prefix code P over {0, 1, #} such that \P\ = \Q\, Pi has the 
same mod 3 cardinality as Qi, and the inner tree T- m (P) has three leaves \\, A2, A3 G {0, 1, #}* 
such that: 

(1) If both qi,q 2 G {0, 1}* then A 3 G {0, 1}*, with |A 3 | + 1 = \qi\ = \q 2 \ mod 3. Moreover (for 
alii = 1,2), both ^ {0,1}*, or both \,ti G {0,1}*, and in the latter case \\\ = \£i\ mod 
3. 

(2) If both q x ,q 2 {0,1}*, and if q\ or q 2 {0, 1}*#, then A 3 ^ {0,1}*. Moreover (for all 
i = 1,2), both Xi,£i (fi {0,1}*, or both G {0,1}*, and in the latter case |Aj| = \£i\ mod 3. 

(3) If both q u q 2 G {0, 1}*#, and if £1 £ {0, 1}*, then A 2 = £ 2 and A 3 ^ {0, 1}*. (However, Ai 
could be G {0,1}* or <£ {0,1}*.; 

Proof. The proof is similar to the proof of Lemma f9. 231 

(1) If qi, q 2 G {0, 1}* we do the following transformation on Q, where z\ and Z2 are the parent 
vertices of qi, respectively q2, and where ST1, ST2, ST3, ST4 are subtrees (below zi or z 2 ) of 
the prefix tree of Q. If z\ and Z2 are on a common path from the root we let Zi be the deeper 
one of the two. 
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I \ becomes I \ 

zl z2 zl z2 

/ I \# / I \# / I \# / I \# 

ST1 ST2 ST3 ST4 ST1 ST3 ST4 

I 

ST2 



/ / 
z2 z2 
/ I \# / I \# 

ST4 . ST1 ST4 

becomes . I 

ST2 

/ / 
zl zl 
/ I \# / I \# 

ST1 ST2 

Let P be the prefix code described by the transformed tree. Then \Q\ = \P\ (since the number 
of vertices has not changed), and Pi has the same mod 3 cardinality as Q\ (since the subtree 
ST1 is moved from z\ to Z2 and z\, Z2 have equivalent depths mod 3). The subtree ST2 was 
under # and is still below a #-edge. Finally, T in (P) has three leaves, namely £1, £2, and z\ (all 
G {0, 1}*), and \z\\ + 1 = (actually the two numbers are equal). An existing leaf is 
either unchanged, or (in case of a leaf in ST1) is changed to a leaf that has an equivalent depth 
modulo 3, or (in case of a leaf in ST2, hence ^ {0, 1}*) is changed to a leaf ^ {0, 1}*. 

(2) If qi,q 2 4- {0> 1}* an d if <?i 4- {0> !}*# ( or > similarly, if q 2 {0, 1}*#), we do the following 
transformation on Q. As above, Z\ and z 2 are the parent vertices of qi, respectively q 2 , and 
ST1, ST2, ST3, ST4, ST5 are subtrees (below z\ or z 2 ) of the prefix tree of Q; one of ST3 and 
ST5 is a single vertex (corresponding to the word q 2 ). Again, if z\ and z 2 are on a common 
path from the root we let z\ be the deeper one of the two. 

I# \ I \ 

I \ becomes I \ 

zl z2 zl z2 

/ I \ / I \# / I \# / I \# 

ST1 ST2 ST3 ST4 ST5 ST3 ST4 ST5 

I 

ST1 
I 

ST2 
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/ / 

z2 z2 
/ I \# / I \# 

ST4 . ST4 

#. becomes #. I 

ST2 

/ / I 

zl zl ST1 

/ I \ / I \ 

ST1 ST2 



Then the transformed tree describes a maximal prefix code P with the desired properties. In 
particular, P± = Qi, and z\ ^ {0, 1}* is now a leaf. 

(3) Suppose gi, q 2 G {0, 1}*# and t x £ {0, 1}*. 

• If T m (Q) has other subtrees besides the paths root-to-^i and root-to-^2, then T m (Q) has 
another leaf besides t\ and £ 2 . In this case we have nothing to prove. 

• If T in (Q) consists only of the paths root-to-£i and root-to-£ 2 , let pi be the parent vertex of 
i\ in T in (Q). Since l\ ^ {0, 1}*, the path root-to-^i of T in (Q) has some edge(s) labeled by #. 

Since 91, q 2 G {0, 1}*#, the paths root-to-^i or root-to-£2 of T- m (Q) have some edges labeled 
over {0, 1}. If Zi and z 2 (the parent vertices of qi and q 2 in the prefix tree of Q) are on a 
common root-to-leaf path, let z\ be the name of the deeper one of the two; then z 2 is at least 
2 depth levels above l\ or i 2 . If z\ and z 2 are on different paths root-to- leaf, let z 2 be on the 
path root-to-^; then z 2 will be at least one depth level above £ 2 (since qi,q 2 are not children 
of zi, z 2 ). We do the following transformation on Q. 

I I 
z2 z2 
/ I \# / I \# 

q2 . q2 

(#). (#). /I\ 

/ / 
pi pi 
(#)/|\ becomes (#)/l\ 

11 11 
/l\ 
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\ . \ 

(#) . z2 (#) . z2 

/l\# • /I\# 

/ . q2 / . q2 

pi pi /l\ 

(#)/|\ . becomes (#)/l\ 

11 / 11 / 

/ I \ 12 12 

/l\ /l\ 



In other words, the children of £ x are moved to q 2 . Now |P| = \Q\ (since no new vertices 
are created), and P x = Q x , hence Pi and Q x have the same mod 3 cardinality. Also, T in (P) 
has three leaves, namely q 2 {0, 1}*, £ 2 , and p x . If £ x E {0, 1}*# then p x E {0, 1}*, and if 
£ x £ {0, 1}*# (but still £- 1 £ {0, 1}*) then p x £ {0, 1}*. □ 

Lemma 9.25 The group G™° d3 (0, 1) is generated by its elements of table-size < c gen; for some 
constant c, 



gen • 



Proof. We follow the same method as in the proof of Lemma f6 .41 Let 

E G 3j i(0,l). 



xO xl £4 
2/1 2/2 V3 2/4 



Vn 



The image code's inner tree, T in (imC(<^)), has a leaf y, so {y x , . . . , y n } also contains 3 words of 
the form y ix = yO, y i2 = yl, y iz = y#, where y E {0, 1, #}*. The three indices i x ,i2,h are in 
{1, . . . , n}, but any order relation between i x , i 2 , is is possible. 

Case 1: The column index sets {1,2,3} and {i x , 12,^3} are disjoint. 
Then, after permuting columns (if necessary), the table of (p has the form 

xO xl X4 X5 Xq X-j ... x n 

Vi V2 2/3 2/0 yl y# y 7 ... y n 



Case 1.1: y E {0, 1}*. (The case where, instead, x E {0, 1}* is very similar.) 

Then £4, £5 E {0, 1}*, and \x4\ = \x$\ = \y\ + l mod 3, since (p E G™° d3 (0, 1). Then, applying 
Lemma \9. 231 (1) to the prefix code Q = domC(<^), we obtain a maximal prefix code P with the 
properties listed in that Lemma. In particular, T in (P) has two leaves, £ x E {0, 1}*, \£ x \ = \x\ 
mod 3, and £ 2 E {0, 1}* with \£ 2 \ + 1 = |x 4 | = |x 5 | = |y| + 1 mod 3. These properties imply 
that P can be inserted into the table of ip as an intermediary row, and that the columns can 
be lined up in such a way that (p is factored as two elements of G™° d3 (0, 1): 

xO xl xj^ 

£ x £ x l £ x # £ 2 £ 2 1 £ 2 # z 7 ... z n 
yi 2/2 2/3 2/0 yl 2/# 2/7 • • • 2/«. . 

Now, as in the proof of Lemma T6.4[ the two factors can be extended, so as to get smaller tables. 
Case 1.2: Both x,y £ {0, 1}*. Then x 4 , x 5 , y x , y 2 $ {0, 1}*. 
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Case 1.2.1: If x 4 or x 5 {0, 1}*#, then we apply Lemma |9~2*3I (2) to the prefix code Q = 
domC(<^). If yi or y 2 £ {0, 1}*#, then we apply Lemma ID. 231 (2) to the prefix code Q = imC(y?). 
Next, we insert P into the table of <p in the same way as in case 1.1. 

Case 1.2.2: If x^,x 5 ,yi,y 2 £ {0, 1}*#, we can again apply Lemma 19. 231 (3) to Q — domC(<^). 
If P has both £1,^2 ^ {0, 1}*, we insert P as a row, as in case 1.1. 

However, if £ 2 ^ {0, 1}* and h £ {0, 1}*, we cannot proceed as before, because both 
x, y £ {0, 1}*; the resulting factors of would not stabilize {0, 1}*. So this time we insert P as 
two rows into the table of ip (possibly after permuting columns), as follows: 

X0 Xl £4 X 5 Xq 

£i0 

h# ilO 

2/# Vn-2 



J n—\ 



Vl 2/2 2/3 



£ 2 
yO 



41 
2/1 



^il 

Vn-l 



X n 
2/n 



The columns can be lined up in such a way that the three factors of ip belong to G™° d3 (0, 1). 
Indeed, |P| = \Q\, and Pi has the same mod 3 cardinality as Q\. Also, £4, £5, j/i, 2/2, x,y,£ 2 
^{0,1}*. 

Case 2: The column index sets {1,2,3} and {11,12,^3} overlap. 

Case 2.1: Suppose {1,2} overlaps with {h, 12,13} and {h,i 2 } overlaps with {1,2,3}. 

Then both x, y £ {0, 1}* or both x, y ^ {0, 1}*. By Lemma 19.231 we find a prefix code P, 
with £ 2 £ {0, 1}* if x,y £ {0, 1}*, and £ 2 $ {0, 1}* if a;,y g {0, 1}*. Then we insert P as two 
rows: 



xO 
£ 2 



x 1 



4# 



X4 

4«2 
2/^2 



X 5 
2/^3 



^n-2 
^0 
^0 

2/n-2 



•"n— 1 
hi 
hi 

Vn-l 



X n 
*1# 

Vn 



Case 2.2: Suppose {1, 2} n {h, i 2 , 13} = or i 2 } n {1, 2, 3} = 0. 

Then {1,2,3} fl {11,12,13} = {3} or {1,2,3} H {^1,22,23} = {^3}- We only consider the 
case where the intersection is {3} (the case when it is is very similar). Then the table of 
p is 

X0 Xl X4 £5 

2/i 2/2 yai ya 2 ya 3 

Case 2.2.1: Ux,ye {0,1}*, then yai i {0,1}* (since 93 £ G^ d3 (0, 1)); hence a x = #. Now 
we proceed as in case 1.1. 

Case 2.2.2: If x, y {0, 1}*, then y±,y 2 ,X4,x 5 £ {0, 1}*. Now we proceed as in case 1.1. 
Case 2.2.3: If x £ {0, 1}* and y g {0, 1}*, then y l5 y 2 £ {0, 1}*. 

We apply Lemma f9. 241 (1) to the maximal prefix code Q = imC(<y9) with existing leaf I = 
y {0, 1}*, and with q x , q 2 equal to yi, y 2 £ {0, 1}* respectively. Then we obtain a code P with 
h £ {0, 1}*, and with £ 2 £ {0, 1}*, \£ 2 \ + 1 = \yi\ = \y 2 \ mod 3. Now we insert P into the table 
of (p as two rows, to obtain (after permuting columns, if necessary): 

xO xl x# x 4 x 5 x r 



Xn—4: X n — 3 3^n— 2 X n —\ 



,0 



2/i 2/2 



4# 

£\d\ £\a 2 ha 3 
ya x ya 2 ya 3 



?i0 



1 



# 



2/„_5 y n _ 4 2/n-3 



^2# 
l/n-2 



hi 

Vn-l 



X n 

ho 

Vn 
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Here we assume that the columns n — 3 and n — 2 (that contain 4#, respectively 4#) are 
disjoint. This assumption can always be made if n is large enough so that {xi, . . . ,x n } and 
{yi, . . . , y n } contain enough elements G {0, 1}* and ^ {0, 1}*. Then we can insert another copy 
of P as follows: 



xO 


xl 






x 5 


■ ■ ■ -En— 5 ^n—A -^n— 3 


*^n— 2 -En—l 




40 


41 


4# 






... 40 41 4# 
... 40 41 4# 


4# 41 


40 






l\d\ 


t\a 2 


4a 3 . . . 




4# 41 


40 


yi 


2/2 


yai 


ya-2 


ya 3 ... 


y-n-5 yn-A 2/n-3 


y n _ 2 y n -i 





This gives us a factorization of as four elements of Gf° d3 (0, 1), each of which can be reduced. 

Case 2.2.4: The case where y G {0, 1}* and x £ {0, 1}* is similar to case 2.2.3, now using 
Q = domC(^). □ 

In analogy with Lemma f6. 51 Lemma f9. 251 can be strengthened as follows. 

Lemma 9.26 Every element tp G Gf° d3 (0, 1) of table- size > c gen can be represented by a word 
over the set of elements of table-size < c gen , and such that the sequence has table-size 
< \\ip\\. The constant Lemma 1.9. 25\ 

Proof. This follows from the proof of Lemma f9. 251 In that proof, we started out with a table 
of ip (of table-size ||<£>||), and repeatedly inserted rows. No columns are ever added, hence the 
table-size doesn't increase. See also the proof of Higman's Lemma 4.3 in [Tl]. □ 

Lemma 9.27 The group G™° d3 (0, 1) is presented by relators of table- size < c Te \, in terms of 
generators of table-size < c gen , where c gen is the constant from Lemma V9.2fy and c re i is another 
constant. Hence, G™° d3 (0, 1) is finitely presented. 

Proof. We use the same approach as in Proposition 16.61 fbased on Higman's proof that G^^. 
is finitely presented (see [H], pp. 29-33). We now use Lemma 19.241 

For the same reason as in Lemma [9.251 the new rows that are inserted have their columns 
lined up in such a way that all pairs of adjacent rows represent elements of G™° d3 (0, 1) (and 
not just of (j3,i). 

Higman's "type II" reductions (described in the figure in the top of p. 31 of [Hj) can actually 
be replaced by his "type III reductions" (described in the figure in the top of p. 32 of [T4*j). 
Type II reductions are never needed (the reason why they were used by Higman is probably 
that they are more efficient: they require a single row insertion; on the other hand, a type III 
reduction consists of two transformations). 

Type III reductions require that we insert a row corresponding to a prefix code with 3 leaves 
in the inner tree (see the figure at the top of p. 31 in [Tij). Since one of the pre-existing rows 
in the table already has two leaves (b and c in Higman's notation), we want the table size to 
be large enough so that the maximal prefix code Q = {b0, 61, 6#, cO, cl, c#, . . .} (2nd row of 
figure at bottom of p. 31, and 2nd row of figure at top of p. 32 in ^3]) contains either another 
leaf in its inner tree or two words that are not children of a leaf of the inner tree. In the latter 
case we apply Lemma f9.24l and obtain a maximal prefix code P with three leaves x,y,z with 
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x equivalent to b and y equivalent to c. (Here we define two words u, v G {0, 1, #}* to be 
equivalent iff both u,v £ {0, 1}* or both u,v & {0, 1}* and |w| = |t> | mod 3.) Also, P and Q 
have the same cardinality, and Pi and Q\ have the same mod 3 cardinality. Therefore, we can 
insert a row corresponding to the prefix code P in exactly the same way as on p. 32 of [14] . 
taking care to line up the columns so that the factors belong to G™° d3 (0, 1). □ 

Just as for G™° d3 (0, 1; one can prove that G™° (0,1) is not a simple group; a very 
similar homomorphic image can be taken. We can summarize the results for G™° d3 (0, 1) as 
follows. 

Theorem 9.28 The group G™° d3 (0, 1) is finitely presented, and not simple. 

The word problem of G™° d3 (0, 1), over the generating set Aq,i U {t^+i : < i} is coNP- 
hard, with respect to constant- arity conjunctive polynomial-time reduction. Here Ao,i is a finite 
generating set o/G™° d3 (0, 1). 

As a consequence of Proposition 17. II we can consider the following HNN-extension: 

#(0,1) = (G-° d3 (0, 1) U {t} : {ifff^j^^GGf^l)}). 

Since G™° d3 ({0, 1}*) is finitely generated, the HNN-relations form a finite set; moreover, since 
G^° (0, 1) is finitely presented (by teh above Theorem), the whole HNN-extension is a finitely 
presented group 

For the same reason as for G™° d3 (0, 1; Jf) in Section 7, we obtain: 

Lemma 9.29 The HNN-extension H(0, 1) is isomorphic to the subgroup (G™° d3 (0, l)U{K32i}) 
of the Thompson group G^i- 

In summary, we obtain Theorem 17.31 as well as the other main theorems, for G™° d3 (0, 1) and 
#(0,1). 

Section 8 shows that the word problem of #(0, 1) (over a finite generating set) is in coNP. 
9.4 Miscellaneous 

The following is a converse of Proposition 17.11 This converse gives an interesting property of 
G 3 n ° d3 (0, 1) (that G 3 n ° d3 (0, 1; #) does not have), but we make no use of it in this paper. 

Proposition 9.30 Let us abbreviate k^k^iO t° K - If 9 ^ ^3,1 i> s such that the conjugates of 
g under k or k~ x belong to G^i, then g G G 3 rL ° d3 (0, 1). In other words, 

Gf° d3 (0,l) = {g e G 3A : g^g*- 1 e G 3A }. 

Proof. Suppose g G G 3jl and g K ,g K 1 G G 3j i. 
Claim 1: g g Stab({0, 1}*). 

Proof of Claim 1: By contraposition we assume that g (£ Stab({0, 1}*), hence g- 1 (£ Stab({0, 1}*), 
and we will prove that g K ^ G^i. 

If g,g~ x Stab({0, 1}*) then (perhaps after replacing g by g^ 1 ), there is z G {0, 1, #}* — 
{0,1}* such that z G Dom(g) and g(z) G {0,1}*; let y = g(z). Let x = k~ l (z); note that 
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z G Dom(«; 1 ) since z contains the letter #. Then x contains # too, so x G Doiii(k); moreover, 
for all v G {0, 1, #}*, k(xv) = k(x) v = zv. For any w G {0, 1}* we have 

xw # K ( x ) w# = zw# g( z ) W # = yw# (G {0, 1}*#) 

We want to show now that domC(g K ) is infinite (when g K is maximally extended). Assume by 
contradiction that domC(g K ) is finite; so the elements of domC(g K ) have length < b for some 
constant b. 

Recall the definition of k and its relation with the permutation 737271 (•) of N, described in 
the beginning of the paper: 

737271 (•) = 

( ... I 6(j + l) I 6j I ... I 12 I 6 I 2 I 5 I 8 I .. . | Si + 2 | 3(i + 1) + 2 | . . . ) • 
( . . . I 6(7 + 1) + 3 I 6j + 3 I ... I 9 I 3 I 1 I 4 I 7 I ... | Si + 1 | S(i + 1) + 1 | ... )(•)• 
Therefore, the application of to yu># changes bit number S(i — 1) +2 to bit number 3i + 2 
of yw, for every i, < % < (\yw\ — 2)/3. Let us pick a w G {0, 1}* which is much longer than 6. 
Then g K (xwjf) (= K~ l (yw4f)) cannot we written in the form g K (xw#) = g K (uv#) = g K (u) v#, 
for any factorization of imj# as = uv# with \u\ < b. This contradicts the assumption 

that the elements of domC(g K ) have length < b. 

Therefore, g K does not belong to G 3; i. This proves Claim 1. 

From here on we can assume that g G Stab({0, 1}*). 
Claim 2: g g G^ d3 . 

Proof of Claim 2: By contraposition we assume that g £ G™ d3 , hence g~ x ^ G™° d3 ; we will 
prove that g K ^ G 3:1 . 

If g,g~ x $ G^' d3 then there is z G {0,1}* such that z G Dom(g), g(z) G {0,1}*, and 
\g(z)\ ^ \z\ mod 3. 

Since the action of k consists of permuting bits over a distance < 6 we have the following: 
There exist x, s G {0,1}*, with \s\ < 6 and |x| = \z\, such that k(xs#) = zt# for some 
t G {0, 1}* with \t\ = \s\. 

For any w G {0,1}* we have n(xsw4f) = zvw'#, for some v,w' G {0,1}* with \v\ = \s\, 
\w'\ = \w\, and where w' depends only on s and w (and not on x). Let y = g(z). Then we have: 

xsw# h-^-> zvw'# h-^-> (?(^) ™'# = yvw'# (G {0, 1}*#) 

k— > K _1 (yv w' 

where |y| ^ |x| mod 3, and |x| = |z|. 

Then K~ l (yvw'jf) = y'v'w"#, for some y',v',w" G {0,1}* with = |y|, |t>'| = and 
|u>"| = |ti/| = |u>|. However, since \y\ ^ |x| = \z\, it follows that w" differs from w in every bit 
position. 

We want to show now that domC(g K ) is infinite (when g K is maximally extended). Assume 
by contradiction that domC(g K ) is finite; so the elements of domC(g K ) have length < b for some 
constant b. Let us pick a w G {0, 1}* which is much longer than b. 

Since the application of g K to xsw# changes all the bits w, it follows that g K (xsw#) cannot 
we written in the form g K (xsw4j z ) = g K {uv#) = g K (u) v#, for any factorization of xsw as 
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xsw = uv with \u\ < b. This contradicts the assumption that the elements of domC(<7 K ) have 
length < b. 

Therefore, g K does not belong to £73,1. This proves Claim 2. □ 
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